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PREFACE. 


This edition differs from the first l ) by the new arrangement 
of the contents of several chapters, some of which have been completed 
by more recent results, and by the suppression of a number of errors, 
obligingly pointed out by Mr. V. Jarnik, which formed the object 
of the two pages of Errata in the first edition. It is probable that 
fresh errors have slipped in owing to modifications of the text, but 
the reader would certainly find many more, if the author had not 
received the valuable help of Messrs J. Todd, A. J. Ward and 
A. Zygmund in reading the proofs. Also, Mr. L. C. Young has greatly 
exceeded his role of translator in his collaboration with the author. 
To all these I express my warmest thanks. 

This volume contains two Notes by S. Banach. The first 
of them, on Haar’s measure, is the translation (with a few 
slight modifications) of the note already contained in the French 
edition of this book. The second, which concerns the integration 
in abstract spaces, is published here for the first time and com- 
pletes the considerations of Chapter I. 

The numbers given in the bibliographical references relate 
to the list of cited works which will be found at the end of the book. 
The asterisks preceding certain titles indicate the parts of the book 
which may be omitted on first reading. 

S. Saks. 

W arszawa - Zoliborz , July, 1937. 


l ) S. Saks, Theorie de VIntigraU, Monografie Matematyczne, Volume II, 
Waniawa 1933. 



FEOM THE PREFACE TO THE FIRST EDITION. 


The modern theory of real functions became distinct from clas- 
sical analysis in the second half of the 19-th century, as a result of 
researches, unsystematic at first, which dealt with the foundations 
of the Differential Calculus or which concerned the discovery of 
functions whose properties appeared to be very strange and un- 
expected. 

The distrust with which this new field of investigation was 
regarded is typified by the attitude of H. Poincar4 who wrote:“ Autre- 
fois quand on inventait une fonction nouvelle , c'etait en vue de quelque 
but pratique ; aujourd’kui on les invente tout expres pour mettre en defaui 
les raisonnements de nos pbres et on n'en tirera jamais que celd". 

This view was by no means isolated. Ch. Hermite, in a letter 
to T. J. Stieltjes, expressed himself in even stronger terms: u Je me 
ddtourne avec effroi et horreur de cette plaie lamentable des fonctions 
qui n'ont pas de ddrivdes". Researches dealing with non-analytic 
functions and with functions violating laws which one hoped were uni- 
versal, were regarded almost as the propagation of anarchy and 
chaos where past generations had sought order and harmony. Even 
the first attempts to establish a positive theory were rather sceptically 
received: it was feared that an excessively pedantic exactitude 
in formulating hypotheses would spoil the elegance of classical 
methods, and that discussions of details would end by obscuring 
the main ideas of analysis. It is true that the first researches hardly 
went beyond the traditional, formal apparatus, fixed by Cauchy 
and Riemann, which was difficult to adapt to the requirements 
of the new problems. Nevertheless, these researches succeeded in 
opening the way to applications of the Theory of Sets to Analysis, 
and — to quote H. Lebesgue’s inaugural lecture at the College 
de France — u ihe great authority of Camille Jordan gave to the new 
school a valuable encouragement which amply compensated the few 
reproofs it had to suffer". 

R. Baire, E. Borel, H. Lebesgue — these are the names which 
represent the Theory of. Real Functions, not merely as an object of re- 
searches, but also as a method, names which at the same time recall 
the leading ideas of the theory. The names of Baire and Borel 
will be always associated with the method of classification of 
functions and sets in a transfinite hierarchy by means of certain 
simple operations to which they are subjected. Excellent accounts 



of this subject are to be found in the treatises: Ch. J. de la Valine 
Poussin, Fonctions d' ensemble, Integrate de Lebesgue, Classes de Baire , 
1916, F. Hausdorff, Mengenlehre, 1927, H. Hahn, Theorie 
der reellen Funktionen, 1933 (recent edition), C. Kuratowski, third 
volume of the present collection, and finally in the book of W. Sier- 
piriski, Topologja ogolna (in Polish), and its English translation, 
Oenerhl Topology, to be published in 1934 by the Toronto Uni- 
versity Press. 

The other line of researches, which arises directly from the 
study of the foundations of the Integral Calculus, is still more in- 
timately connected with the great trains of thought of Analysis in 
the last century. On several occasions attempts were made to gen- 
eralize the old process of integration of Cauchy -Riemann, but it 
was Lebesgue who first made real progress in this matter. At the 
same time, Lebesgue’s merit is not only to have created a new and 
more general notion of integral, nor even to have established its 
intimate connection with the theory of measure: the value of his 
work consists primarily in his theory of derivation which is parallel 
to that of integration. This enabled his discovery to find many 
applications in the most widely different branches of Analysis and, 
from the point of view of method, made it possible to reunite the 
two fundamental conceptions of integral, namely that of definite 
integral and that of primitive, which appeared to be forever 
separated as soon as integration went outside the domain of 
continuous functions. 

The theory of Lebesgue constitutes the subject of the present 
volume. While distinguishing it from that of Baire, we have no 
wish to erect an artificial barrier between two streams of thought 
which naturally intermingle. On the contrary, we shall have frequent 
occasion, particularly in the last chapters of this book, to show expli- 
citly how Lebesgue’s theory comes to be bound up not only with 
the results, but also with the very methods, of the theory of Baire. 
Is not the idea of Denjoy integration at bottom merely a striking 
adaptation of the idea which guided Baire ? Where Baire, by repeated 
application of passage to the limit, widened the class of functions, 
Denjoy constructed a transfinite hierarchy of methods of integration 
starting with that of Lebesgue and whose successive stages 
are connected by two operations: one corresponding exactly to the 
generalized integral of Cauchy and the other to the generalized inte- 
gral of Harnack-Jordan. 



VI 


Now that the Theory of Eeal Functions, while losing perhaps 
a little of the charm of its first youth, has ceased to be a “new” science, 
it seems superfluous to discuss its importance. It is known that the 
theory has brought to light regularity and harmony, unhoped for 
by the older methods, concerning, for instance, the existence 
of a limit, a derivative, or a tangent. It is enough to mention the 
theorems, now classical, on the behaviour of a power series on, or 
near, the boundary of its circle of convergence. Also, many 
branches of analysis, to cite only Harmonic Analysis, Integral Equa- 
tions, Functional Operations, have lost none of their elegance where 
they have been inspired by methods of the Theory of Real Functions. 
On the contrary, we have learnt to admire in the arguments not only 
cleverness of calculation, but also the generality which, by an apparent 
abstraction, often enables us to grasp the real nature of the problem. 

The object of the preceding remarks has been to indicate the 
place occupied by the subject of this volume in the Theory of Real 
Functions 1 ). Let us now say a few words about the structure of the book. 
It embodies the greater part of a course of lectures delivered by the 
author at the University of Warsaw (and published in Polish in 
a separate book 2 )), which has been modified and completed by 
several chapters. The reader need only be acquainted with a few 
elementary principles of the Theory of Sets, which are to l>e found 
in most courses of lectures on elementary analysis. Actually a sum- 
mary of the elements of the theory of sets of points is given in one 
of the opening paragraphs. 

Several pages of the book are inspired by suggestions and 
methods which I owe to the excellent university Lectures of my 
teacher, ‘ W. Sierpiriski, the influence of whose ideas has often 
guided my personal researches. Finally, I wish to express my warmest 
thanks to ail those who have kindly assisted me in my task, partic- 
ularly to my friend A. Zygmund, who undertook to read the manu- 
script. I thank also Messrs 0. Kuratowski and H. Steinhaus for 
their kind remarks and bibliographical indications. 

8 . Saks. 

Warszawa, May, 1933. 

1 ) In this preface, I made no attempt to write a history of the early 
lays of the theory, and still less to settle questions of priority of discovery, 
lut, since an English Edition of this hook is appearing now, 1 think I ought to 
lention the name of W. H. Young, whose work on the theory of integration 
tarted at the same period as that of Lehesgue. 

2 ) Zarys teorji calki, Warszawa 1930, Wydawnictwo Kasy im. Mianow- 
;iego, Instytutu Popierania Nauki. 



CHAPTER I. 


The integral in an abstract space. 

$ 1. Introduction. Apart from functions having as argument 
a variable number, or system of n numbers (point in n-dimensional 
space), we shall discuss in this book functions for which the inde- 
pendent variable is a set of points. Functions of this kind have 
occurred already in classical Analysis, in several important particular 
cases. But they only began to be studied in their full generality 
during the growth of the Theory of Sets, and in close relation to 
the parts of Analysis directly based on that theory. 

If we are, for instance, given a function f(x) integrable on every 
interval, then by associating with each interval / the value of the 
integral of f(x) over 1, we obtain a function F(l) that is a function 
of an interval. Similarly, by taking multiple integrals of functions 
/( x v x 2 , x„) of n variables, we are led to consider functions of 

more general sets lying in spaces of several dimensions, the argu- 
ment 1 of our function F{1) being now replaced by any set for which 
the integral of our given function f(x v x 2 , ..., x„) is defined. 

We dwell on these examples in order to emphasize the natural 
connection between the notion of integral (in any sense) and that 
of function of a set. Needless to say, there are many other examples 
of functions of a set. Thus in elementary geometry, we have for 
instance, the length of a segment or the area of a polygon. The 
class of values of the argument of these two functions (the length 
and the area) is in the first case, the class of segments and in the 
second, that of polygons. The problem of extending these classes 
gave birth to the general theories of measure, in which the notions 
of length, area, and volume, defined in elementary geometry for 
a restricted number of figures, are now r extended to sets of points 
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of much greater diversity. It is, nevertheless, remarkable that these 
researches arose far less from problems of Geometry than from 
their connection with problems of Analysis, above all with the 
tendency to generalize, and to render more precise, the notion of 
definite integral. This connection has occasionally found expression 
even in the terminology. Thus duBois-Eeymond called integrable 
the sets that to-day are said to be of measure zero in the Jordan sense. 

The theories of measure have, in the course of their development, 
been modified in accordance with the changing requirements of the 
Theory of Functions. In our account, the most important part will 
be played by the theory of H. Lebesgue. 

Lebesgue’s theory of measure has made it possible to dis- 
tinguish in Euclidean spaces a vast class of sets, called measurable, 
in which measure has the property of complete additivity — by this 
we mean that the measure of the sum of a sequence, even infinite, 
of measurable sets, no two of which have points in common, is equal 
to the sum of the measures of these sets. The importance of this 
class of sets is due to the fact that it includes, in particular, (with 
their classical measures), all the sets of points occurring in problems 
of classical Analysis, and further, that the fundamental operations 
applied to measurable sets lead always to measurable sets. 

It is nevertheless to be observed that the ground was prepared 
for Lebesgue’s theory of measure by earlier theories associated 
with the names of Cantor, Stolz, Harnack, du Bois-Reymond, 
Peano, Jordan, Borel, and others. These earlier theories have, 
however, to-day little more than historical value. They, too, were 
suitable instruments for studying and generalizing the notion of 
integral understood in the classical sense of Riemann, but their 
results in this direction have been largely artificial and accidental. 
It is only Lebesgue’s theory of measure that makes a decisive step 
in the development of the notion of integral. This is the more re- 
markable in that the definition of Lebesgue apparently requires 
only a very small modification of a formal kind in the definition 
of integral due to Riemann. 

To fix the ideas, let us consider a bounded function f(x, y) 
of two variables, or what comes to the same thing, a bounded func- 
tion of a variable point defined on a square K n . In order to deter- 
mine its Riemann integral, or more precisely, its lower Riemann- 
Darboux integral over K 0 , we proceed as follows. We divide the 
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square K 0 into an arbitrary finite number of non-overlapping 
rectangles K u R 2 , R„, and we form the sum 

( 1 . 1 ) ivi-m(R t ) 

1=1 

where v, denotes the lower bound of the function / on R t , and m(R f ) 
denotes the area of R ( . The upper bound of all sums of this form is, 
by definition, the lower Riemann-Darboux integral of the function / 
over K 0 . We define similarly the upper integral of / over K 0 . If 
these two extreme integrals are equal, their common value is called 
the definite Riemann integral of the function / over K 0 , and the 
function / is said to be integrable in the Riemann seme over K 0 . 

The extension of measure to all sets measurable in the Lebesgue 
sense, has rendered necessary a modification of the process of Rie- 
maiin-Darboux, it being natural to consider sums of the form (1.1) 
for which [Ri) f is a subdivision of the square K 0 into a finite 

number of arbitrary measurable sets, not necessarily either rectangles 
or elementary geometrical figures. Accordingly, m(Rj) is to be un- 
derstood to mean the measure of R The Vj retain their former 
meaning, i. e. represent the lower bounds of / on the corresponding 
gets R, . We might call the upper bound of the sums (l.l) interpreted 
in this way the lovper Lebesgue integral of the function / over K 0 . 
Rut actually, this process is of practical importance only for a class 
of functions, called measurable , and for these the number obtained 
as the upper bound of the sums (l.l) is called simply the definite 
Lebesgue integral of / over /t 0 . What is important, is that the func- 
tions which are measurable in the sense of Lebesgue, and whose 
definition is closely related to that of the measurable sets, form 
a very general class. This class includes, in particular, all the func- 
tions integrable in the Riemann sense. 

•Apart from this, the method of Lebesgue is not only more 
general, but even, from a certain point of' view, simpler than that 
of Riemann-Darboux. For, it dispenses with the simultaneous 
introduction of two extreme integrals, the lower and the upper. 
Thanks to this, Lebesgue’s method lends itself to an immediate 
extension to unbounded functions, at any rate to certain classes 
of the latter, for instance, to all measurable functions of constant 
sign (cf. below § 10). Finally, the Lebesgue integral renders it per- 
missible to integrate term by term sequences and series of functions 
in certain general cases where passages to the limit under the in- 
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tegral sign were not allowed by the earlier methods of integration. 
The reason for this is to be found in the complete additivity of Le- 
besgue measure. The fundamental theorems of Lebesgue (cf. below 
§ 12) stating the precise circumstances under which term by term 
integration is permissible, are justly regarded by Ch. J. de la Vall6e 
Poussin [I, p. 44] as one of the finest results of the theory. 

Lebesgue’s theory of measure has, in its turn, led naturally 
to further important generalizations. Instead of starting with area, 
or volume, of figures, we may imagine a mass distributed in the 
Euclidean space under consideration, and associate with each set 
as its measure, (its “weight” according to Oh. J. de la Valine 
Poussin [I, Chap. VI; 1]), the amount of mass distributed on the set. 
This, again, leads to a generalization of the integral, parallel to 
Lebesgue’s, known as the Lebesgue-Stieltjes integral. In order 
to present a unified account of the latter, we shall consider in this 
chapter an additive class of measurable sets given a priori in an 
arbitrary abstract space. We shall suppose further, that in this 
class, a completely additive measure is determined for the mea- 
surable sets. These hypotheses determine completely a corresponding 
method of integration in the Lebesgue sense. All the essential prop- 
erties of the ordinary Lebesgue integral, except at most those im- 
plying the process of derivation, remain valid for this abstract integral. 
From this point of view, in a more or less general form, the Lebesgue 
integral has been studied by a number of authors, among whom 
we may mention J. Radon [1], P. ,J. Daniell [2], O. Nikodym [2] 
and B. Jessen [1], For further generalizations (of a somewhat 
different kind) see also A. Kolmogoroff [1], S. Bochner [1], 
G. Fichtenholz and L. Kantorovitch [1], and M . Gowu- 
rin [1]. 

# 2. Terminology and notation. Given two sets A and B, 
we write AC B when the set A is a subset of the set B, i. e. when 
every element of A is an element of B. When we have both AQ B 
and B(ZA, i. e. when the sets A and B consist of the same ele- 
ments, we write A — B. Again, a e A means that a is an element 
of the set A (belongs to A). By the empty set, we mean the set without 
any element; we denote it by 0. A set A is enumerable if there exists 
an infinite sequence of distinct elements a x , a 2 , ..., a„, ... consisting 
of all the elements of the set A. 
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Given a class *21 of sets, we call sum of the sets belonging to 
this class, the set of all the objects each of which is an element of 
at least one set belonging to the class 91. We call product, or common 
part, of the sets belonging to the class 91, the set of all the objects 
that belong at the same time to all the sets of this class. We call 
difference of two sets A and B, and we denote by A — B, the set 
of all the objects that belong to A without belonging to B. 

Given a sequence of sets {A,,} — a finite sequence A 1? A 2 , ..., A„, 
or an infinite sequence A,, A 2> ..., A „, ... — we denote the sum by 

n 

2A„ by % A „ or by Aj A a -f- ... -f- A„, in the finite case, and by 

i /- 1 

oo 

2A„ by X A h or by A 1 + A 2 + ...+ A„4- ... in the infinite case. 

i 1 

Similarly, merely replacing the sign 2 by IT, we have the expres- 
sion for the product of a sequence of sets. If the sequence {A,,} is 
infinite, we call upper limit of this sequence, the set of all the ele- 
ments a such that a<- A „ holds for an infinity of values of the index n. 
The set of all the elements a belonging to all the sets A„ from some « 
(in general depending on a) onwards, we call lower limit of the se- 
quence \A„\. The upper and lower limits of the sequence (A„'„ 
we denote by limsupA,, and lim inf A„ respectively. We have 

n n 

oo oo oo oo 

(2.1) lim inf A„ = X FI A„C 1 1 X A„ — lim sup A„. 

n k 1 n^k h— 1 n h n 

If lim sup A n ~ lim inf A, ls the sequence {J.„} is said to be convergent; 

a n 

its upper and lower limits are then called simply limit and denoted 
by lim A„. 

II 

If, for a sequence {A„) of sets, we have A„CA, l+ i, for each n, 
the sequence {A„j is said to be ascending, or non-decreasing ; if, for 
each n, we have A /1+ iC A,„ the sequence (A„) is said to be descending 
or non-increasing. Ascending and descending sequences are called 
monotone. We see directly that every monotone sequence is con- 
vergent, and that we have lim A „ = 2' A „ for every ascending se- 

n n 

quence (A„), and lim A„— TLA,, for every descending sequence (A,,}. 

n u 

Finally, given a class (E of sets, we shall often call the sets 
belonging to C£, for short, sets ((E). The class of the sets which 
are the sums of sequences of sets ((E) will be denoted by (E n . The 
class of the sets which are the products of such sequences will 
be denoted by (E,> (see F. Hausdorff [II, p. 83]). 
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$ 3. Abstract space X. In the rest oi this chapter, a set A” 
will be fixed and called space. The elements of -V will be called point s. 
It A is any set contained in A r the set A' — A will be called com- 
plement of A with respect to A'; the expression “with respect to A’” 
will, however, generally be omitted, since sets outside the space X 
will not be considered. The complement of a set A will be denoted 
by CA. We evidently have, for every pair of sets A and B, 

(3.1) A — B — A- CB, 
and for every sequence |A„) of sets 

n x„ = cscx „ , x x„ — c/7 cx „ , 

, v u » n n 

(3.2) 

lim sup X n = C lim inf CX „ , lim inf X„ = C lim sup CX„. 

ii it n n 

In the space A' we shall consider functions of a set, and functions 
of a point. The values of these functions will always be real numbers, 
finite or infinite. A function will be called finite , when it assumes 
only finite values. 

To avoid misunderstanding, let us agree that when infinite func- 
tions are subjected to the elementary operations of addition, subtrac- 
tion etc., we make the following conventions: o-f ( ±°°)=( +co )+a= ±oo 

for (-f<X>)-f( 00) = ( CX3)-f(-foo)==(±CX5) — (±oo)=0; 

o • (±o°)=(± 00 ) • «= ±°° and a-(± 00 )=(± c, °)- a=T°o, according 
M «>0 or o<0; 0 • ( ±oo)=( ±oo) . 0=>0; a/(±oo)=0; a/0=-\-oo. 

We call characteristic function c K (x) of a set E, the function 
(of a point) equal to 1 at all points of the set, and to 0 everywhere 
else. The following theorem is obvious: 

(3.3) If E—X E,„ and E r Ei,= 0 whenever i^k, then c K (x)=X c » (*)• 

If {E„) is a monotone sequence of sets , the sequence of their 
characteristic functions is also monotone , non-dec teaming or non-in- 
creasing according as the sequence [E„) is ascending or descending . 

If {Eh) is any sequence of sets } A and B denoting its upper 
and lower limits respectively \ we have 

Cj (a?)=lim sup c A , (#), and o, B (x) = lim inf c A , (x) ; 

ii J,t H 'n 

so that , in order that a sequence of sets {E„) converge to a set E y 
it is necessary and sufficient that the sequence of their character- 
istic functions {c A ^(a?)} converge to the function \c K (x)\. 



Additive classes of sets. 


7 


[Ml 


A function assuming only a finite number of different values 
on a set E is called a simple function on E. If v x , v„ are all 
the distinct values of a simple function f(x ) on a set E, the 
function f(x) may on E be written in the form 

f(x)—Zvi,c h .(x) where E=E 1 +E 2 -\-...+Ei, and 2J,-i?,=0 for i=\=j. 
* 

The function / given by this formula over the set E will be denoted 
by E x , v t , E 2 ', ...; v„, E„). 

The notion of characteristic function is due to Oh. J. de la Vallee 
Poussin [1] and [I, p. 7]. 

# 4. Additive classes of sets. A class 9 C of sets in the space A' 
will be called additive if (i) the empty set belongs to SC, (ii) when 
a set X belongs to 9C so does its complement GX. and (iii) the 
sum of a sequence [X„) of sets selected from the class SC, belongs 
also to the class SC. 

The classes of sets, additive according to this definition, are sometimes 
termed completely additive. We get the definition of a class of sets additive In 
the weak sense if we replace the condition (iii) of the preceding definition 
by the following: (iii-bis) the mm of two seta belonging to 9E also belongs to if. 

The sets of an additive class SC will be called sets measurable (SC), 
or, in accordance with the definition given in § 2 (p. 5), simply 
sets (SC). We see at once that, on account of the conditions (i) and (ii), 
the space A', as complement of the empty set, belongs to every 
additive class of sets. Making use of the relations (2.1), (3.1), and (3.2), 
we obtain immediately the following: 

(4.1) Theorem. If SC is an additive class of sets, the sum, the 
product, and the two limits, upper and lower, of every sequmce of sets 
measurable (SC), and the difference of two sets measurable (SC), are also 
measurable (SC). 

In later chapters we shall consider certain additive classes of 
sets that present themselves naturally to us, in connection with 
the theory of measure, in metrical or in Euclidean spaces. In the 
abstract space A', about which we have made practically no hypo- 
thesis, we can only mention a few trivial examples of additive 
classes of sets, such as the class of all sets in A', or the class of all 
finite or enumerable sets and their complements. Let us still mention 
one further general theorem: 
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(4.2) Theorem. Given any class SO) of sets in X, there exists 
always a smallest additive class of sets containing 90), i. e. an 
additive class 91 0 3 90) contained in every other additive class that 
contains 90). 

For let 9) 0 be the product of all the additive classes that con- 
tain 90). Such classes evidently exist, one such class being the class 
of all sets in X. We see at once that the class 9) 0 thus defined has 
the required properties. 

$ 5. Additive functions of a set. In the rest of this chapter 
we suppose that a definite additive class 96 of sets is fixed in the space X. 
In accordance with this hypothesis, we may often omit the symbol 96 
in our statements, without causing any ambiguity. 

A function of a set, 0(X), will be called additive function of 
a set (96) on a set E, if (i) E is a set (96), (ii) the function 0( X ) 
is defined and finite for each set A' C E measurable (96), and if 
(iii) 0(2 X„) — X 0(X„) for every sequence {X,,} of sets (96) con- 

// /i 

tained in E and such that X, ■ X k —0 whenever i For simplicity, 
we shall speak of an “additive function” instead of an “additive 
function of a set (96)” whenever there is no mistaking the meaning. 
An additive function of a set (96) will be called monotone on E 
if its values for the subsets (96) of E are of constant sign. A non- 
negative function <P(96) additive and monotone, will also be termed 
non-decreasing, on account of the fact that, for each pair of sets 
A and B measurable (96), the inequality A(Z B implies 0(B) — 
— 0(A) -f 0(B — A) ^ 0(A). For the same reason, non-positive 
monotone functions will be termed non-increasing. 

(5.1) Theorem. If 0(x) is an additive function on a set E, then 

(5.2) $(limX„) = lim 0(X„) 

It ll 

for every monotone sequence {X„) of sets (96) contained in E. If 0(X) 
is a non-negative monotone function, then 

(5.3) 0(liminf X„)^liminf 0(X„) and #(lim sup X„)^lim sup<P(X„) 

n n n n 

for every sequence |X„) of sets (96) in E. 
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Proof. Let be a sequence of sets (9 E) contained in E. 

If (X„) is an ascending monotone sequence, then 

oo oo 

lim X„ = V X„ = X x + 2 (X„ +1 — X„), 

it 1 n— 1 

and consequently, 0(X) being an additive function on E, 

oo 

0(limX„) = 0(X x ) + v 0(X n+l — X n ) = 

n /i— 1 

= lim [0(X x ) +"z<D(X k+t — X*)] = lim <P(X„). 

n A- 1 n 

If {X,,} is a descending sequence, the sequence {E — X,,} is 
ascending, and, by the result already proved, 

0(E)— <P(limX,J=0[lim(E— X„)] = lim0(-E— X„)=0(E)— limO>(X„), 

n n ii n 

from which (5.2) follows at once. 

Finally, if (X,,} is any sequence, but 0(X) is a non-negative 
monotone function, we put 

(5.4) ¥„ = U X k for n = 1,2,... 

A ii 

The sets Y„ are measurable (SE) on account of (4.1), and form an 
ascending sequence. We therefore have, by the part of our theorem 
proved alread) 

(5.5) 0 (lim X„) = lim 0 ( ¥„). 

n n 

Now, it follows from (5.4) that Y„C X„, and so, 0(Y„)^. 0(X n ), 
for each n. On the other hand, lim inf X n = lim ¥„, and therefore 

it n 

the first of the relations (5.3) is an immediate consequence of (5.5). 
We establish similarly (or, if preferred, by changing X„ to E — X„) 
the second of these relations, and this completes the proof of the 
theorem. 

Every function of a set 0(X), additive on a set E, can easily 
bo extended to the whole space X. In fact, if we write, for instance, 
0 y (X) = 0{X-E) for every set X measurable (2E), we see at once 
that 0 x (X) is a function additive on the whole space X, that co- 
incides. with 0(X) for measurable subsets of E and vanishes for 
measurable sets containing no points of E. We shall call the func- 
tion <P x (X), thus defined, the extension of 0(X) from the set E to 
the space X. 
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$ 6. The variations of an additive function. The upper 

and lower bounds of the values that a function of a set 0(X), ad- 
ditive on a set E assumes for the measurable subsets of this set E, 
will be called upper variation and lower variation of the function 0 
over E, and denoted by W(<P; E) and W(0; E) respectively. Since 
every additive function vanishes for the empty set, we evidently 
have W(0;E)s^ W(<P; E). The number W(<P;E)+ |W(<Z>;E)| 

will be called absolute variation of the function 0 on E and denoted 
by W (0;E). 

(6.1) Theorem. If 0{ X) is an additive function on a set E, its 
variations over E are always finite. 

Proof. Suppose that W(<P; E) = -\- oo. We shall show firstly 
that there then exists a sequence {E„)„ ),•»,. of sets (9E) such that 

(6.2) E„QE„ mi for n> 1; W(<P; E„) = oo; \0(E„)\ > n — 1. 

For let us choose E 1 — E and suppose the sets E„ for n = 1,2, ...,k 
defined so as to satisfy the conditions (6.2). By the second of these 
conditions with n k, there exists a measurable set A C Eh such that 

(6.3) \0(A)\^\0(Eh)\ + k. 

If W('/>;A) = oo, we have only to choose E k +\ — A in order to 
satisfy the conditions (6.2) for n = k 1 . If, on the other hand, 
W( 0-,A) is finite, we must have W(0 ; E k — A) = + oo, and, by (6.3), 
0(Eh — A)\^t\0(A)\ — \0{Eh)\^k, so that the conditions (6.2) 
will be satisfied for n~k + 1, if we choose E k+ \ = E k — A. The 
sequence {E„\ is thus obtained by induction. 

Now, on account of Theorem 5.1 and of the third of the con- 
ditions (6.2), we should have the equality 0(]imE ll )=\im0(E„)=oo, 

n It 

and since every additive function of a set is, by definition, finite, 
this is evidently impossible. Q. E. D. 

It follows from the theorem just proved that every function 
0(X) additive on a set E is not only finite for the subsets (9E) 
of E, but also bounded ; in fact, the values it assumes are bounded 
in modulus by the finite number W(<P; E). 
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Theorem 6.1 can be further completed as follows: 

(6.4) Theorem. For every function F (X) additive on a set E, 
the variations W(<P;X), W( ®\X) and W(<P; X) are also additive 
functions of a set (9E) on E, and we have , for every measurable set 

JC-® 

(6.5) <P(X)=W(0;X) + W(0;X). 

Proof. To fix the ideas, consider the function Q x (X) = W($>-, X). 
Since this function is finite by Theorem 6.1, we have to show 
that for every sequence (X,,! of measurable sets contained in E, 
and such that X ( -X* = 0 whenever i=fck, 

(6.6) fl 1 (2I„) = 2fl 1 (X„). 

/f /I 

For this purpose, let us observe that for every measurable set XCXX„ 

n 

we have ^I) = 2l(II„)<Ifl 1 (X„), and hence 

/I ll 

(6.7) fl 1 (2l ll K^fi I (X„). 

n n 

On the other hand, denoting generally by Y„ any measurable set 
variable in X,„ we have £1,(^1,,)^ T„) = E0(T„), and 

n n n 

therefore also Dj(2X„)>2fli(I„). Combining this with (6.7) we 

n n 

get the equality (6.6). 

Finally, to establish (6.5), we observe that for every measurable 
subset Y of X we have 0(Y)=0(X)— 4>(X— Y)^O(X)— W(0;X), 
andsoW(<P;X)<$(X)— W(0; X). Similarly W( 0; XP*0(X)— W(0;X). 
These two inequalities give together the equality (6.5), and the 
proof of Theorem 6.4 is complete. 

It follows from this theorem that every function of a set 0(X) 
additive on a set E is, on E, the difference of two non-negative 
additive functions. The formula (6.5) expresses, in fact, 0(X) as 
the sum of two variations of 0(X), of which the one is non-negative 
and the other non-positive; this particular decomposition of an 
additive function of a set will be termed the Jordan decomposition. 
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We can now complete Theorem 5.1 as follows: 

(6.8) Theorem. If ®(X) is additive on a set E, we have 
tf'(lim X„) = lim $(X„) for every convergent sequence {X,,} of sets (9E) 

U II 

contained in E. 

In fact, making use of the Jordan decomposition, we may 
restrict ourselves to non-negative functions ®(X), and for these 
Theorem 6.8 follows at once from the second part of Theorem 5.1. 

$ 7. Measurable functions. Given an arbitrary condition, 
or property, ( F) of a point x, let us denote generally by E[( F)] 

A* 

the set of all the points x of the space considered that fulfill this 
condition, or have this property. Thus, for instance, if f(x) denotes 
a function of a point defined on a set E and a is a real number, 
the symbol 

(7.1) E[xeE-,f{x)> a] 

V 

denotes the set of the points x of E at which f(x) > a. 

A function of a point, f(x), defined on a set E , will be termed 
measurable (9E)> or simply junction (SE), if the set E } and the set (7.1) 
for each finite a, are measurable (9£). It is easy to see that 

(7.2) In order that a function f(x) be measurable on a measurable 
set E> it suffices that the set (7.1) should be so for all values of a be - 
longing to an arbitrary everywhere dense set R of real numbers (the 
same holds with the set (7.1) replaced by the set E[a?e2£; f(x)^a]). 

X 

In fact, for every real a, the set R contains a decreasing 
sequence cf numbers (r„) converging to a. We therefore have 

oo 

E[® f £; /(®) > «] = 2’ B[a; f B; / (®) > r„] and, each term of the sum 

a * n~ 1 x 

on the right being measurable by hypothesis, the same holds for 
the sum itself (cf. Theorem 4.1). 

Every function f{x) measurable on a set E, can be continued 
in various ways, so as to become a measurable function on the 
whole space X. For definiteness, we shall understand by the ex- 
tension of the function f(x) from the set E to the space X, the func- 
tion f 0 (x) equal to f(x) on E and to 0 everywhere else. For brevity, 
we shall often deal only with functions measurable on the whole 



Measurable functions. 


13 


LI 7] 

space X, but it is easy to see that all the theorems and the 
reasonings of this and of the succeeding, § could be taken relative 
to an arbitrary set (SE). 

The equations 

E [f(x) <«]=CE [f{x) > a], E [fix) ^ a] = /? E \f(x) > a — - L 

x .r x n t x l tlj 

E [f(x) <a] = CE [fix) ^ a], E[f(x)=a] = E [f(x)^a] • E [/(*) <a], 

X X X X X 

E[fix)<+oo]= y E[f(x)<n], E[f(x) > — oo] = f EL/(a?) > — »], 

x n 1 .v n 1 v 

E [/(*)= 4- oo]=CE[/(®) < -f-oo], E [fix) = — oo] = CE [/(*)> — oo] 

-V X \ A 

show that for every measurable function f(x) and for every number a, 
the left hand sides are measurable sets. Conversely, in the definition 
of measurable function, we may replace the set (7.1) by any one 
of the sets E[/(o?)^a], E[/(a?)^a} or E[/(#)<a]; this follows at 

.V A .V 

once from the identity 

oo 1 1 oo r 

E[/(«)>a]= ve fix)7?a+- = CE [fix) ^ a] = C II E /(:r)«?+- . 

A II 1 V ^ l // 1 A W 

To any function fix) on a set E, we attach two functions fix) 
and fix) on E, called, respectively, the non-negative part and the 
non-positive part of fix) and defined as follows: 

fix) = fix) or 0 according as fix) ^ 0 or fix) < 0, 

fix) — fix) or 0 according as fix) < 0 or f(x)^ 0. 

o 

We see at once, that in order that a function he measurable on a set E y 
it is necessary and sufficient that its two parts , the non-negative and 
the non-positive , be measurable . 

Returning now to the notions of characteristic function, and 
simple function introduced in § 3, we have the theorem: 

(7.3) Theorem . In order that a set E be measurable (90, it is ne- 
cessary and sufficient that its characteristic function be measurable . 
More generally , in order that , on a set I?, a simple function f(x) be 
measurable (9£), it is necessary and sufficient that , for each value 
of f(x), the points at which this value is assumed on E y should con- 
stitute a measurable subset of E . 

Another theorem, of great utility in applications, is the fol- 
lowing: 
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(7.4) Theorem. Every function f(x) that is measurable (9E) and 
non-negative on a set E, is the limit of a non-decreasing sequence 
of simple functions , finite, measurable and non-negative on E. 


In fact, if we write for each positive integer n and for xeE, 


i—1 


/»(*) = 


2 " 1 
n , 


if. < f(x) <± 1 % i < 2" • n, 

if /(*) > n, 


the functions f„(x) thus defined are evidently simple and non-negative, 
and, on account of Theorem 7.3, measurable on E. Further, as is 
easily seen, the sequence !/„(«)! is non-decreasing. Finally lim f„(x)—f(x) 

II 

for every xeE\ for, if f(x) <+oo, we have, as soon as » exceeds 
the value of f(x), the inequalities 0 <ff(x) — f„(x) <1 1/2", while, 
if f(x) = -f oo, we have /„(#) = n for n= 1 , 2, ..., and so 
lim f„{x) = + oo = f(x). 


j| 8. Elementary operations on measurable functions. 

We shall now show that elementary operations effected on measur- 
able functions always lead to measurable functions. 

(8.1) Theorem. Given two measurable functions f(x) and g(x), the sets 

E [f(x)>g(x)], E [/(*)> </(*)] and E[f(x) = g(x)], 

X .\ X 


are measurable . 

The proof follows at once from the identities 

n 


-f-oo 4-oo 

E [/<*)> </(*)]= 1 v.E |/(*)>£ 


n oo m 


■ E 


g(x) 


n 

m 


E[/>fif] = CE[ ? >/] and E [/ = </] = E [f>g]- E [</>/]. 


(8.2) Theorem. If the function f(x) is measurable, \f(x)\“ is also 
a measurable function. 

For a > 0, the proof is a consequence of the identity 
E[|/(*)|" > a] = E[/(a?) > o' ''] 4 E [/(*)< — o' «], 


which is valid for every a 0 , while for a < 0 its left hand side 
coincides with the whole space and therefore constitutes a mea- 
surable set. For a < 0, the proof is similar. 
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(8.3) Theorem. Every linear combination of measurable functions 
with constant coefficients represents a measurable function. 

The identities 

E [a • f(x) +/?><*] — E [/(#) > f or a > 0 , 

• .V X l a 

E[«./(®) + /3>a] = E[/(®)<— ^1 for «< 0, 

.V A l O J 

valid for every function f(x) and for all numbers a, u 4- 0, and ft, 
show, in the first place, that a. f(x) + ft is a measurable function, 
if f(x) is measurable. It follows further, from Theorem 8.1 and 
from the identities: 

E [« • / -f- /? • > a] = E /> — —g + — for a > 0 

x a Cl CC 

E[a-/ + /?-9 > «] = E / < — ^ g + - for «< 0 

X A U (X 

that if f(x) and g(x) are measurable functions, so is a • f(x ) + ji • g(x). 

(8.4) Theorem. The product of two measurable functions f(x) 
and g(x) is a measurable function. 

Measurability of the product f g is derived by applying Theorems 
8.2 and 8.3 to the identity fg — 4 [(/ + g) 2 — (/ — ?)*]• the com- 
pletion of the proof, by taking into account possible infinities 
of / and g, being trivial. 

(8.5) Theorem . Given a sequence of measurable functions (/»(#)!, 
the functions 

upper bound /,, (x), lower bound f„ (x), limsup/„(®) awd liminf/„(«) 

n n n » 

are also measurable. 

The measurability of h(x) — upper bound f„(x) follows from 

n 

the identity E [h(x) >a] = ^ E[7, 7 (a?)>a]. For the lower bound, the 

A l> A 

corresponding proof is derived by change of sign. 

Hence, the functions /t„(x)=upper bound [/„ i \ (x), f„ .>(x), ...] 
are measurable, and the same is therefore true of the function 
limsup fn(x) — Hm h„(x) = lower bound h„(x ). By changing the sign 

n n n 

of /»(*), we prove the same for lim inf. 
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§9. Measure. A function of a set fi(X) will be called a meas- 
ure (9E), if.it is defined and non-negative for every set (9E), and if 

p(2X H ) = 2p(X m ) 

n n 

for every sequence (X„) of sets (9c) no two of which have points 
in common. The number y{X) is then termed, for every set X 'mea- 
surable ($E), the measure (y) of X. If every point of a set E } except 
at most the points belonging to a subset of E of measure (/») zero, 
possesses a certain property V, we shall say that the condition V 
is satisfied almost everywhere ( y ) in E, or, that almost every (y) point 
of E has the property V. We shall suppose, in the sequel of 
this Chapter, that, just as the class 9c was chosen once for all, 
a measure y corresponding to this class is also kept fixed. Accordingly, 
we shall often omit the symbol (y) in the expressions “measure 
(/*)”, “almost everywhere (/*)”, etc. Clearly y(X)*^Ly(Y) for any 
pair of sets X and Y measurable (9c) such that X(2 i\ and 
y(XX,,)^. 2y(X„) for every sequence of measurable sets |X„). 

n n 

A measure may also assume infinite values, and is therefore not in gen- 
eral an additive function according to the definition of § 5. 

The results established in this chapter concerning perfectly arbitrary meas- 
ures will be interpreted in the sequel for more special theories of measure, (for 
instance, those of Lebesgue and Caratheodorv). For the present, we shall 
be content mentioning a few' examples. 

Let us take for 96, the class of all sets in a space -V. We obtain a trivial 
example of measure (9E) by writing t n(X)^0 identically, (or else n (JC)^-f-oo) 
for every set XC.X. Another example consists in choosing an element a in A' 
and writing t u(X) — 1 or n(X)= 0, according as aeX or not. In the case of an 
enumerable space A’, consisting of elements o lf a 2 , ...» a , h the general form 
of a measure /t(X) defined for all subsets X of A" is t n(X)= ^k„j,i(X) where 

n 

{kn} is a sequence of non-negative real numbers and f n (X ) is equal to 1 or 0 ac- 
cording as a n eX or not. It follows that every measure defined for all subsets 
of an enumerable space, and vanishing for the sets that consist of a single point, 
vanishes identically. The similar problem for spaces of higher potencies is much 
more difficult (see S. 11 lam [* 1 ]). For a space of the potency of the continuum 
see also S. Banach and C. K uratowski [1], E. Szpilrajn [1], W. Sierpihski 
[I, p. 60], W. Sierpinski and E. Szpilrajn [1]. 

We shall now prove the following theorem analogous to The- 
orem 5.1: 

(9.1) Theorem . If {X,,} is a monotone ascending sequence of 
measurable sets , then lim = j*(lim X„). The same holds 

monotone descending sequences provided , however , that piX^^oo.^' 
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More generally , for every sequence (Z n ) of measurable sets , 

(9.2) y (lim inf X„) <; lim inf n (X„) 

n n 

and , if further y(¥X„) f 00 , 

n 

(9.3) . ft (lim sup X„) ^ liin sup // ( X„), 

n n 

so that, in particular , if the sequence {Zn) converges and its sum has 
finite measure, lim y (X„) = y (lim X„). 

n n 

Proof. For an ascending sequence (X„} n =t, the equation 
lim^(Z n ) = /<(HmZ n ) follows at once from the relation 

n n 

00 00 

lim Xn == Xn — X j -(- y. ( Xn f I ~ ~ Xn), 

n n—l n 1 

and if the sequence {X n } is descending and y (Zj) 4 1 oo, then the 
measure y(X) is an additive function on the set Zj and consequently 
the required result follows from Theorem f>.l . 

In exactly the same way, if for an arbitrary sequence {X n } 
of measurable sets, ^ X„ is of finite measure, the measure y(X) 

71 

is an additive function on this set, and the two inequalities (9.2) 
and (9.3) follow from Theorem 5.1. To establish the first of these 
inequalities without assuming that the sum of the sets X„ has finite 
measure, we write as in the proof of Theorem 5.1 

¥„ = II Z*. 

k n 

Since the sequence is ascending, and V„(7 X„ for every n, we have 
/<(lim inf X„) — /u(lim Y„) = lim y( ¥,,) lim inf y(X„). 

n ti n n 

We conclude this § with an important theorem due to D. Ego- 
roff, concerning sequences of measurable functions (cf. D. Ego- 
roff [1], and also W. Sierpinski [3], F. Riesz [2; 3], H. Hahn 
[I, pp. 556 — 8]). We shall first prove the following lemma: 

(9.4) Lemma. If E is a measurable set of finite measure (u) and 
if {/»»(*)) is a sequence of finite measurable functions on E, con- 
verging on this set to a finite measurable function f(x), there exists, for 
each pair of positive numbers e, g, a positive integer X and a mea- 
surable subset H of E such that y(H) < g and 

*(9.5) \f"(x) — f(x)\<f 

for every n > N and every xeE — H. 
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Proof. Let us denote generally by E m the subset of E con- 
sisting of the points x for which (9.5) holds whenever n>m. Thus 
defined, the sets E m are measurable and form a monotone ascending 
sequence, since for each integer m, we have 

E„,= U E[xeE-,\f(x) — f n (x)\<e]. 

n- m-ft jc 

Further, since {/„(#)) converges to f{x) on the whole of E, we have 
E=£e„„ and so, by Theorem 9.1, y(E) = limy(E m ), i. e. 

m m 

lim n(E — E m ) — 0, and therefore, from a sufficiently large m 0 

in 

onwards, y(E — E m ) < v. We have now only to choose N = m Q 
and H=E — E„, u , and the lemma is proved-. 

(9.6) Ey or off's Theorem. If E is a measurable set of finite 
measure (y) and if { /„(*)) is a sequence of measurable functions finite 
almost everywhere on E, that converges almost everywhere on this set to a 
finite measurable function f(x), then there exists, for each e>0, 
a subset Q of E such that /i(E — Q) < e and such that the converg- 
ence of \fn(x)) to f(x) is uniform on Q. 

Proof. By removing from E, if necessary, a set of measure (y) 
zero, we may suppose that on E, the functions f n (x ) are everywhere 
finite, and converge everywhere to f(x). By the preceding lemma, 
we can associate with each integer m~> 0 a set E m C E such that 
MH m ) < f/2"' and an index N m such that 

(9.7) | f„(x) — f{x) \ < 1/2"' for n> N,„ and for xeE — H,„. 

oo 

Let us write Q — E — ^ H m . We find 

m i 

oo oo 

,< ( E-Q ) < V /i(ff m )< V t/2"' = f, 

m 1 m - l 

and since the sequence fn(x) converges uniformly to f(x) on the 
set Q on account of (9.7). the theorem is proved. 
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The theorem of Egoroff can be given another form (cf. N. Lusin [I, p. 20]), 
and, at the same time, the hypothesis concerning finite measure of E can be 
slightly relaxed. 

(9.8) If E is the sum of a sequence of measurable sets of finite measure (p) and 
if {fn(x)} is a sequence of measurable functions finite almost everywhere on this set , 
converging almost everywhere on E to a finite function , then the set E can be expressed 
as the sum of a sequence of measurable sets H t E v ... suck that p(H) — 0 and 
that the sequence { fn(x )) converges uniformly on each of the sets En . 

For the proof, it suffices to take the case in which the set E is itself of fin- 
ite measure. With this hypothesis, we can, on account of Theorem 9.0, define 

n 

by induction a sequence {Ek}k=U2,... of measurable sets such that (E — ]]Ek) ^ 1/n, 

k=i 

and that the sequence ifn(x ) } converges uniformly on the set Ek for each k. Choosing 

oo 

E — E — £ Em* we have p(H)— 0, and the theorem is proved. 

£=i 

As we may observe, the hypothesis that the set E is the sum of a sequence 
of sets of finite measure, is essential for the validity of Theorem 9.8. For this pur- 
pose, let us take as a space Jf 0 the interval [0, 1], and as an additive class 9E 0 
of sets, that of all subsets of X Q . Further, let us define a measure p 0 by writing 
/m 0 (X) = oo whenever the set X e 9f 0 is infinite and ju 0 (X) — n, if A is a finite 
set and n denotes the number of its elements. The sets of measure (<w 0 ) zero then 
coincide with the empty set. Finally, let ign(x)} be an arbitrary sequence of fun- 
ctions, continuous on the interval [0, 1], converging everywhere on this interval, 
but not uniformly on any subinterval of [0, 1]. 

To justify our remark concerning Theorem 9.8, it suffices to show that 
the interval X 0 = [0, 1] is not representable as the sum of a sequence {En} of 
sets such that the sequence of functions ! gn(x )} converges uniformly on each of 
them. But if such a decomposition were to exist, we might suppose firstly — 
since the functions gn (x) are continuous — all the sets En closed. Then, however, 
by the theorem of Baire (cf. Chap. II, § 9) one of them at least would contain a 
subinterval of [0, 1]. This gives a contradiction, since by hypothesis, the se- 
quence { gn(x )} does not converge uniformly on any interval whatsoever. 

§ 10. Integral. If we are given ra the space X an additive class 
of sets 9£ and a measure \i defined for the sets of this class, we 
can attach to them a process of integration for functions of a 
point. In fact: 

(i) If f(x) is a function (SE) non-negative on a set E , we shall 
understand by the definite integral (9E, //) of f(x) over E the up- 
per bound of the sums n 

2v k n(E b ), 

A=1 

where {!?*)*= 1,2 n is , an arbitrary finite sequence of sets (9E) such 

that E — E^ E 2 ... “f* E n and Ej • E k ~ 0 for i - j- fcj and where 

»*, for fc = 1, 2, denotes the lower bound of f(x) on E k . 
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(ii) If f(x) is an arbitrary function measurable (90 on a set 23, 
we shall say that f(x) possesses a definite integral (% p) over 23, if 
one at least of the non-negative functions f(x) and — f(x) (cf. §3) 
possesses a finite integral over E according to definition (i). And, 
if this condition is satisfied, we shall understand by the definite 
integral (% p) of the function f(x) over E the difference between 

o 

the integral of f(x) and that of — f(x) over E. The definite integral 

o 

(% p) of f(x) over E will be written (9 E) j f(x) dp (x). If this integral 

k 

is finite, the function f(x) is said to be integrable (SE, p). For every 
function f(x) possessing a definite integral over a set E, we evi- 
dently have 

mffdp = m ffdn-m [(-f)dn = m ffdp + m [tap. 

k k k 0 k k 

We see at once that the two definitions (i) and (ii) are com- 
patible, i. e. that they give the same value of the integral to any 
non-negative measurable function. Moreover: 

(10.1) If g = [v v X x ;v 2 , X 2 , X m \ is a simple non-negative func- 
tion on the set E — X x + X 2 -f- ... -f X„„ the sets X , being measur- 
able (9C), then 

I gdp = Vv,r(X,). 

H ' ' 

For, if [Ej)j I -, is an arbitrary subdivision of E into a finite 
number of sets (90 without points in common, and if Wj denotes 
the lower bound of g(x) on E n we have w t - < v, whenever 23, • X, 4= 0. 

n ti in ii m m 

Hence ^Wjp(E,) = ^ ^Wjp(Ej- X,-)< v ^v,p(E r X,) =^Vip(Xi), 

j- 1 j 1 /- 1 j 1 /'I /I 

and therefore / gdu<i^ViP(X,). The opposite inequality is ob- 

k I * * * * * 7 

’a 

vious, since the sets X„X 2 ,...,X m themselves constitute a subdivision 
of E into a finite sequence of sets (56) on which the values of g(x) 
are v v v 2 ,...,v,„ respectively. 
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§ 11. Fundamental properties of the integral. We shall 

begin with a few lemmas concerning integration of simple functions. 
As in the preceding §§, the symbols % p etc. will often be omitted. 

(11.1) Lemma. 1° For every pair of functions g(x) and h(x), simple, 
non-negative, and measurable (SE) on a set E, we have 

(11.2) f [g(x) + h(x)]dp(x) = f g(x)d,u(x)+ f h(x)dp(x). 

t ’ EE 

2° If the function f(x) is simple, non-negative, and measurable (9E) 
on the set A 4- B where A and B are sets (SE) without common points, then 

(11.3) / f(x) dp (x) = J f(x)dy(x) + / f(x)d/i(x). 

A + B A B 

Proof. As regards 1°, let 

g = {g„ g%, 0 ...» O n ) and h = ( h ffjj •••) h m , E m }, 

where E = G l +...+Q„ = #!+... +H m . 

We then have, by (10.1), 

n m 

I [g(x)+h(x)] dll ( x ) = V V ( gi +hj) (I ( Gj ■ Ej) = 

E m 1 J ~ l 

n m m n 

= Z 9 , 1 9(0, • Hj)+ v hj v [,(0, • ff,) = 

< i j = i y-i i 

== 2 gMGi)+I hjii(Hj)= j g(x)du(x) + / h(x)dfi(x). 

' 1 7 1 A’ A 

As regards 2°, if £7 = A+B and f = [f v Qii f t ,Q*i -> fn,Qn), 
where B =@ 1 +@ 2 +. ..+#,„ we have 

ff(x)du(x).= ifrl,m = ifrli(A.Q,) + ifrli(B.Q,) = 
if > I ' I ' 1 

= f f(x)du (X) + ff(x)dp(x). 

A B 

(11.4) Lemma . If {g„(x)) is a non-decreasing sequence of functions 
that are simple, non-negative, and measurable (9 E) o» o set E, and if, 
for a function h(x), simple, non-negative, and measurable, on E, 
we have lim g„(x) ^ h(x) on E, then 

lim J g„ ( x ) d,u (x) ^ J h(x) dp ( x ). 

" ft ft 


(11.5) 
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Proof. Let h = {v v E t ; v 2 , E % ; ...; v m , E m ), where 
0 v 2 < v 2 < ... < v m and E==E 1 -\-E 2 -\- ...-\-E m . 

We may suppose v 2 > 0, for, otherwise, we should have 
J hd(i = j hdfi, and, since j g n dp ^ f g n dfi, we could replace 

E E — E\ E E — Ei 

the set E by the set E — E t on which h(x) does not vanish any- 
where. Further we shall assume first that v m < + OO. 

Let us choose an arbitrary positive number f < v t , and let 
us denote, for each positive integer », by Q„ the set of the points 
a of E for which g n (x)>h(x) — f. The sets Q„ evidently form an 
ascending sequence converging to E, and, by Theorem 9.1, we have 
p(Q n ) ->n(E). This being so we have two cases to distinguish: 

(i) r{E) =# oo. We then can find an integer n 0 such that for 
»>n 0 we have fi(E — Q„) < f, and therefore, by Lemma 11.1, 

/ g n dfi > f g n dfi > f [h(x) — e]d/i (x) = 

& Qn Q/i 

= J hdfi—e/i(Q n j hd/i — v m -p(E — Q„) — ep(Q n )^ / hdfi — [v m +n(E)]-e-, 
Q„ E k 

and, passing to the limit, making first n -> oc, and then e -> 0, 
we obtain the inequality (11.5). 

(ii) p(E)= oo. Then, since f g„dfi^(v 1 — e)fi(Q„), we obtain 

E 

lim [g n df4= oo, so that the inequality (11.5) is evidently satisfied. 

n E 

Suppose now v„, = -f oc. Then by (10.1) and by what has 

m l 

already been proved, lim g n dti^v-p(E m )+ £ v r p(E,) for any 

n j£ /=] 

finite number v, and consequently for v — -f oo = v,„ also; whence, 
in virtue of (10.1) the inequality (11.5) follows at once. 

(11.6) Lemma. If the functions of a non-deereasing sequence 
{g„(x)} are simple, non-negative, and measurable (9 t) on a set E, and if 

g(x) = lim g„(x), then lim I g,,{%) d,u (x) — j g(x) d p (x). 

n n g ft 

Proof. Let E v E 2 , ..., E„, be an arbitrary subdivision of E 
into a finite number of measurable sets, and let v v v 2 , ..., v,„ be 
the lower bounds of g(x) on these sets respectively. Let us write 
v={v v EtfVg, E 2 , E m ). We evidently have lim g„ (x)=g(x)&sv(x) 

n 

on E, and hence, by Lemma 11.4 and by Theorem 10.1 
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lim j g„ dfi ^ f v dfi = ]£v t n (E t ). 
n k e ^ 

It follows that lim f g n du ^ / g d.w, and since the opposite 

" k e 

inequality is obvious, the proof is complete. 

We are now in a position to generalize Lemma 11.1 as follows: 

(11.7) Theorem . The relation (11.2) holds for every pair of functions , 
g(x) and h(x ), non-negative and measurable (9E) on the set E, and the 
relation (11.3) holds for every function f(x) non-negative and measur- 
able (SE) on the set i + 5, where A and B are sets (9E) without 
points in common . 

Proof. By Theorem 7.4 there exist two non-decreasing se- 
quences {< 7 „(#)) and {h n (x)} of simple non-negative functions mea- 
surable (9E) on E, such that g(x) = ]img n (x) and h(x) — lim h n {x). 

n n 

Now, by Lemma 11.1 (1°), we have f (g n + h„)dp= f g n dp + f h n du 

k k k 

and hence, making n -> oo, we obtain, on account of Lemma 11.6, 
the relation (11.2). Similarly, if we approximate to f(x) on A + B 
by a non -decreasing sequence of simple non-negative functions and 
make use of Lemma 11.1 (2°). we obtain the relation (11.3). 

(11.8) Theorem . 1° For any function measurable (9E), the integral over 
a set of measure zero is equal to zero . 2° If the functions g(x) and 
h(x) measurable on a set E are almost everywhere equal on E , and 
if one of the two is integrable on E , so is the other , and their integrals 
over E have the same value. 3° If a function f(x) measurable (9E) 
on a set E has an integral over E different from + oo, the set of the 
points x of E at which f(x) = -f °° has measure zero. In particular , 
if the integral of f(x) over E is finite , the function f(x) is finite 
almost everywhere on E. 

Proof. We obtain at once part 1° of this theorem by making 
successive use of the definitions (i) and (ii) of § 10. 

As regards 2°, it is evidently sufficient to consider the case 
of non-negative functions g(x) and h(x). If we denote by E l the 
set of the points x of E at which g(x) ^ h(x ), we ha\ e by hypothesis 
^(JB7 1 ) = 0, and, on account of (1°) and of Theorem 11.7, we obtain 

j gdp = j gdp = I hdfi = J hdy, as required. 

E E ~E X E-E x E 
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Finally, as regards 3°, let us suppose that for a function f(x) 
measurable (SE) on E we have f(x) =■ 4- oo on a set E 0 (Z E of positive 

measure. We then have / f dp ^ j f dp^n p (E 0 ) for every », and 

k k„ 

so I f du = - f oo. Consequently, the integral of f(x) over E, if it 

k 

exists, is positively infinite, and this completes the proof. 

We now generalize Lemma 11.1 (1°) and also complete The- 
orem 11.7, as follows: 

(11.9) Theorem of distributivity of the integral. Every linear 
combination with constant- coefficients, a-g(x) + b-h(x) of two func- 
tions g(x) and h(x), integrable (3E, p) over a set E, is also integrable 
over E, and we have 

(11.10) ) (ag -\-bh)dn = a f gdp + b j hdp. 

k k k 

Proof. By Theorem 11.8 (3°), the set of the points at which 
either of the functions g(x) and h(x) is infinite, has measure zero, 
and if we replace on this set the values of both functions by 0, we 
shall not affect the values of the integrals appearing in the relation 

(11.10) . We may therefore suppose that the given functions g and h 
are finite oh E. Further, the relations 

J agdp = a J gdj>, J bhdp —b j hdu 

B B E F. 

being obvious, we need only prove the formula (11.10) for the 
case a=b= 1. Finally, the set E can be decomposed into four 
sets on each of which the two functions g(x) and h(x) are of cons- 
tant sign. So that, on account of Theorem 11.7, we may assume 
that the functions g(x) and h{x) are of constant sign on the whole 
set E. Now, by the same theorem, the relation 

(11.11) f (g -\-h)du = j gdp + f hdn 

B BE 

holds whenever the functions g and h are both non-negative or 
both non-positive on E, and it only remains, therefore, to show 
that this relation is valid when g and h have, on E, opposite signs, 
the one, g(x) say, being non-negative, the other, h(x), non-positive. 
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This being so, let E l and E % be the sets consisting of the points 
x of E for which we have g(x)+h(x) ^ 0 and g (sc)-f h (x) <0, 
respectively. The functions g, g+h, and — h are non-negative on E l 
and we therefore have, by Theorem 11.7, 

/ g dii = I (g + h)dp + I ( — h) d/i = j ( g+h ) dp — / h do. 

k, k, k, k, k, 

Similarly 

— f h dp = I ( — h) dp = j (— g—h ) dp+ j gdp = — / (g+h) dfi -f / gdp. 

k. k, k, k . k. k. 

Therefore, for * = 1, 2, we have / (g+h) du — / gdfi + f hdfi, and 

h t h , hj 

by Theorem 11.7 we obtain the relation (11.11). 


(11.12) Theorem on absolute intcyrability. 1° In order that 
a function f(x) measurable (9E) on a set E should be integrable (9£, ft) 
on E , it is necessary and sufficient that its absolute value should be 
so. 2° If, for a function g(x) measurable (9E) on a set E, ti. e exists a 
function h(x), integrable (T, p) and such that \g.(x)\ ^h(x) on E, 
then the function g(x) also is integrable on 22; in particular, every 
function measurable (SE) and bounded on a set E of finite measure (p) 
is integrable (% y) on E. 

Proof. As regards 1°, we have by Theorem 11.7 

l\f\dp=[fdp+[(-f)dp, 

k k k 

and integrabibty of |/| is therefore equivalent to that of / and that 
of — / holding together, i. e. to integrability of /. 

•o 

As regards 2°, we have the inequalities g(x)^\g(x)\^h(x) and 
— |ff(a?)l < h(x) on E, and, since h(x) is, by hypothesis, inte- 

grable on E , it follows that the same is true of the non-negative func- 
tions g and — q, and therefore of the function g(x). 
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As an immediate consequence of Theorem 11.12 we have the 
following theorem, known as the 

(11.13) First Mean Value Theorem . Given , on a set 2J, a 
function f(x) bounded and measurable (9E) on E and a function g(x) 
integrable (9E, (i) on E , the function f(x) • g(x) is integrable on E 
and there exists a number y lying between the bounds of f(x) on 
E , such that 

(11.14) / \f(x)\g(x)\dn(x) —y-f \g(x)\dfi(x). 

k k 

Proof. If we denote by m and M respectively the lower and 
the upper bound of f(x) on E, and make use of Theorem 11.12, 
we verify successively, that the functions (\M\-{-\m\)-\g(x)\, \f(x)g(x)\, 
f(x) \g(x)\ and f(x) g(x) are integrable on E. Further, we have 
m\g(x )\ ^ f( x )-\g(x )\ ^ M\g(x)\ over E, and, therefore also, 

m I \g\du^ I f-\g\du^M f\g\dg, and so choosing y=\ / f-\g\dfi\:\ ( \g\d,u] 

E E E EE 

(or, if the denominator vanishes, an arbitrary y between m and M)> 
we obtain the formula (11.14) with m^y^M. 

§ 12. Integration of sequences of functions. In this §, 
we shall establish some theorems on term by term integration of 
sequences and series of functions. 

(12.1) Theorem . If the functions of a sequence { g n (x )} are finite and 
integrable (9E, //) on a set E of finite measure , and the sequence 
converges uniformly on E to a function g(x ), then the function g(x) 
also is integrable over E , and we have 

(12.2) lim lg„(x)dn(x) = \g(x)dn{x). 

" k k 

Proof. By Theorem 8.5, the function g(x) is measurable (9t) 
on E. The functions g(x ) — g„(x) are therefore all measurable also, 
and, further, since the sequence !<p,(a;)) converges uniformly to g(x) 
the functions g(x) — g„(x) are all bounded, at any rate from some 
value of the index n onwards. These functions are thus, by The- 
orem 11.12 (2°), integrable on E, and it follows, by Theorem 11.9, 
that the function g(x) — [g{x) — g„(x)} -f g„(x) is integrable too. 
Finally, denoting by e„ the upper bound of \g(x) — <M*)| on E, 
we have 
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I / gleftdu {x,^~ f g„(x)dfi(x)\^ f \g(x)—g n (x)\d/i(x)^t n ii(E), 

k * 7 /. k 

and this establishes the relation (12.2) since, by hypothesis, £„->0 
and ft (E) 4= oo. 

Neither the theorem thus established, nor its proof, contains, 
at bottom, anything new, as compared with the similar result for 
the classical processes of integration of Cauchy, or of Riemann. 
Vt\> now pass on to th<k> proof of theorems more closely related to 
Lebesgue integration. Among these theorems, a fundamental part 
is played by the following one, which is due to Lebesgue: 

CO 

(12.3) Theorem. Let f(x) — f t ,(x) be a series of non-negative functions 

n i 

measurable ( s 3) on a set E. Then * 

OG 

(12.1) / f(x)dn(x) = V / f„(x)d;u(x). 

t: " 1 k 

Proof. From Theorem 11 7, we derive in the first place, that 

.in m 

I f du = ^ I f„dtt for every m, and so 

k k " 1 " 1 k 

oo 

(12.5) / 

A. " 1 A 

To establish the opposite inequality, let us attach, in accordance 
with Theorem 7.1, to each function f n (x) a non-decreasing sequence 
{g ( n\x))ti i. 2 . of simple functions measurable and non-negative on E, 
in such a manner that lim g K *\x) = f„(x) for »=1, 2,... Let us write 

k 

k 

s k (x) = ^ g\ k) (< r ')- The functions s k (x) are clearly simple, measurable, 

/ i 

and non-negative, on E, and they form a non-decreasing sequence. 

rn 

Further, for each m, and for k ^ m, we have ^ g\ (x) «*(«) f(x). 

I 1 
III 

Making k >oo, we derive f,(x) ^ lim s/,(x) ^/(x) for every m, 

it A 

and so, /(r) — liin sj,(x). Therefore, by Lemmas 11.6 and 11.1 (1°), 

A 

. k ■ oo 

/ f (lit = lim / .s-fr = lim / g\ k) d/i ^ / /, d//, 

A * A * ~ 1 A '* 1 A 

and this, combined with (12.5), gives the equality (12.4). 



28 


CHAPTER I. The integral in an abatraet space. 


Theorem 12.3 may also be stated in the following form: 

(12.6) Lebesgue’s Theorem on integration of monotone se- 
quences of functions. If \f„(x)} is a non-deereasing sequence of 
non-negative functions measurable (X) on a set E, and /(a?) = lim/,, (as), 

then I f(x) du (as) = lim j f„(x) du (as). 
k " k 

Proof. If we write g„{x) = f, H \ — /„(as), we obtain 
/(«) = A (as) + 1 g„(x), 

n i 

and the functions g„(x) will be non- negative and measurable on E, 
so that by Theorem 12.3 

/ fdu = / /l du + v I g n dii = lim / [/j f v (Ini dll = lim / f k d,u. 

/.; f. n 1 f. fi f, n 1 k y 

Q. E. D. 

(12.7) Theorem of additivity for the integral. If {E,,} is a 
sequence of sets measurable (X) no two of which have common points, 
and E =2.’ E„, then 

n 

(12.8) f f dfi = v j f du 

K " 

for every function f(x) possessing a definite integral ( finite or infinite) 
over E. 

Proof. It is clearly sufficient to prove (12.8) in the case of 
a function /(as) non-negative on E. Supposing this to be the case, 
let us write /„(as) = /(as) for xeE„, and /„(as) = 0 for x <- E — E„. We 
then have f(x)—^f„(x) on E, and, the functions /„ being measurable 

n 

and non-negative, we may apply Lebosgue’s Theorem 12.3. This 
gives, by Theorem 11.7, 

ff do = V / /„ d/i - V / f n dii -V /'/«*,,. Q. E. D. 

« y " H " h 

/. I. r. n r., t 

If a function /(as) has a definite integral (X, /<) over a set E, 
then fix) also has a definite integral over any subset of E mea- 
surable (X). We may therefore associate with it the function of 
a set (36) defined as follows: 

(12.9) F(X) — J f(x) il/i (x) where XC E and X tX. 
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The latter will be called the indefinite integral (SE, y) of f(x) on E. 
It follows from Theorem 12.7 that, whenever the function f(x) 
is integrable (9 l, ft) on E, its indefinite integral is an additive func- 
tion of set (SE) on E. 


We end this § with two simple but important theorems. The 
first is known as Fatou's lemma, and appears for the first time 
(in a slightly less general form) in the classical memoir of P. Patou 
[1, p. 375] on trigonometric series. The second is due to Lebesgue 
[5, in particular p. 375], and is called the theorem on term, by term 
integration of sequences of functions-, cf. also Ch. J. de la Valine 
Poussin [1, p. 445—453], R. L. Jeffery [1] and T. H. Hilde- 
brand t [2]. 

(12.10) Theorem (Fatou's Lemma). If {f,,(x)) is any sequence 
of non-negative functions measurable (SE) on a set E, we have 

j lim inf f„(x) d i (x) lim inf f f„(x) dtt(x). 

k " " k 


Proof. Lotus write g t (x) — lower bound [//(#), f-,+\ (x),f, + 2 (®),...] 
where i— 1 , 2 , .... Thus defined [g,(x)) is a non-decreasing sequence 
of non-negative functions measurable on E, and converges on the 
set E to liminf /,(a?). We therefore have, by Lebesgue’s Theorem 12.6, 

i 

I lim inf f t (x) du(x) = lim f g,(x) du (x) lim inf f f,(x) dy(x). 


(12.11) Lebesgue’s Theorem on term by term integration. 
Let ( f„(x )) be a sequence of functions measurable (9E) on a set E, 
fulfilling, for a function s(x) integrable (SE) on E, the inequality 
\f„(x)\ 8(x) for n— 1, 2, ... Then 


( 12 . 12 ) 


lim inf f f„ da ^ f lim inf /„ dg, 

" e i " 

lim sup J f„ d/t^ I lim sup /„ du. 

" . e k " 


If, further , the sequence (f„) converges on E to a function f, the sequence 
is integrable term by term, i. e. toe have 

lim f f„ dp = f f dy. 

" it if 


( 12 . 13 ) 
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Proof. Let g(x) = lim inf f n {x) and let h(x) = lim sup f„(x). 

n n 

We may clearly suppose s(x) < + oo throughout E. We then derive 
from Fatou’s Lemma 12.10, lim inf J («4-/„)<fy*^ J (s + g)dft and 

E E 

lim inf J (8—f„)dft^ f (s — h)dft, which gives at once the rela- 

" B E 

tions (12.12). 

Further, if lim f„(x)=f(x), we derive from (12.12) the relation 

n 

lim inf J f„d(i > j fdft ^ lim sup J f„d(t which gives the equality 

n E E n E 

<12.13). 


§ 18 . Absolutely continuous additive functions of a set# 

The fact that the indefinite integral of a function integrable (% (i) 
on a set E is, on E, an additive function of a set (9E), raises the 
problem of characterizing directly the additive functions expres- 
sible as indefinite integrals. 


If we restrict ourselves to the Lebesgue integral of functions of a real var- 
iable, we may regard indefinite integrals as functions of an interval, or, what 
comes to the same thing, as functions of a real variable. In that case, a neces- 
sary and sufficient condition for a function to be expressible as the indefinite 
integral of a real function was given, in 1904, still by Lebesgue [I, p. 129, foot- 
note]. A little later (in 1905), G. Vi tali [1] explicitly distinguished the class 
of functions possessing the Lebesgue property by introducing the name of “ab- 
solutely continuous functions’*. 

The condition of Lebesgue and Vitali was later extended to functions of 
a set by J. Radon [1] (cf. also P. J. Daniell [2]). But Radon considered only 
additive functions of sets measurable in the Borel sense in Euclidean spaces, 
and only measures determined by additive functions of intervals (cf. below Chap- 
ter III). The final form of the condition of Lebesgue- Vitali, as given in 
Theorem 14.11 below, is due to 0. Nikodym [2]. 


An additive function of a set (SE) on a set E, will be said to 
be absolutely continuous (9E, ft) on E, if the function vanishes 
for every subset (9 E) of E whose measure (ft) is zero. An additive 
function ®(X) of a set (90) on a set E will be termed singular (90, ft) 
on E, if there exists a subset E 0 C. E measurable (90), of measure (ft) 
zero, such that <P(X) vanishes identically on E — E a , i. e. 
0(X)= 0(E o -X) for every , subset X of E measurable (90). The 
following statements are at once obvious: 
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(13.1) Theorem . 1° In order that an additive function of a set (9E) on a 
set E should be absolutely continuous (9E, y) [or should be singular ] 
it is necessary and sufficient that its two variations, the upper and 
the lower , should both be so. 2° Every linear combination, with con- 
stant coefficients, of two additive functions absolutely continuous 
[or singular ] on a set E is itself absolutely continuous [or singular ] 
on E. 3° If a sequence {Q n (X)) of additive functions, absolutely 
continuous [ singular ] on a set E, converges to an additive function 
<P(X) for each measurable subset X of E, then the function $(X) is 
also absolutely continuous [singular]. 4° If a function of a set (9E) 
is additive and absolutely continuous [singular] on a set E, the 
function is so on every measurable subset of E. 5° If E~^E n , 

n 

where {E,,} is a sequence of sets (9E), and if an additive function $(X) 
on E is absolutely continuous [singular] on each of the sets E n , the 
function is absolutely continuous [singular] on the whole set 25. 
6° An additive function of a set cannot be both absolutely conti- 
nuous and singular on a set E , without vanishing identically on E. 

For sets of finite measure, it is sometimes convenient to apply 
the following test for absolute continuity: 

(13.2) Theorem . In order that a function ®(X) additive on a set E of 
finite measure , be absolutely continuous (9E, //) on E, it is necessary 
and sufficient that to each £>0 there correspond an f?>0, such that 
y(X)<y imply \<D(X)\<e for every set 1C E measurable (9E). 

Proof. It is evident that the condition is sufficient. To prove 
it also necessary, let us suppose the function ®(X) absolutely con- 
tinuous in E. We may assume, replacing if necessary, $(X) by 
its absolute variation, that $(X) is a non-negative monotone func- 
tion on E. This being so, let us suppose, if possible, that there 
exists a sequence {E n ) n of measurable subsets of E , such that 
/* (£?„)< 1/2" and that ^(E„)>tj 0 , where % is a fixed positive number. 
Let us write E 0 = lim sup E, t . For every n, we then have 

/i 

l*(E 0 ) ^ £ p(Ek) ^ 1/2" and therefore i*(E 0 )=Q. On the other 

k n 

hand, by Theorem 5.1, we have ^(E 0 ) ^ lim sup *]<>■ This 

/I 

is a contradiction, since $(X) is absolutely continuous, and the 
proof is complete. 
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(13.3) Theorem. In order that a function 0(X), additive on a set 
E, be singular (9E,/i) on E, it is necessary and sufficient that for 
each e>0 there exist a set 1C E measurable (9 E) and fulfilling the 
two conditions fi(X)<e, W(<P;P — X) < e. 

Proof. The condition is clearly necessary. To prove it suf- 
ficient, let us suppose that for each n there is a set X„QE meas- 
urable (9 E) such that .«(Z„)<l/2" and W($; E — X n ) < 1/2", and 

oo 

let us write E 0 = lim sup X„. We then have /<(E 0 ) ^ £ (i(X k ) ^ 1 /2' 1 ' 

/i k^n 

for each n, and so fi(E 0 ) = 0. On the other hand, by Theorem 5.1 
we have W( 0; E — E 0 )<i lim inf W(<5; E — X„) — 0. The function 

n 

0(X) is therefore singular on E. 

K 14. The Lebesgue decomposition of an additive 
function. Before proving the result announced in the preceding §, 
we shall establish some auxiliary theorems. We begin with the follow- 
ing theorem due to H. Hahn [I, p.404] (cf. also W. Sierpihski [11]): 

(14.1) Theorem. If @( X) is an additive function of a set (9£) on a 
set E, there exists always a set PC_E measurable (9E), such that 
W((P;P) = 0 = W (0-, E — P), or, what comes to the same thing, 
such that 0(X)^O for every measurable set X C P 0{X) ^0 
for every measurable set X C E — P. 

Proof. For each positive integer n, we choose a set E„ such 
that 0(En)^ W(0; E) — 1/2". By Theorem 6.4 we then have, 

(14.2) W(0;F„)> — 1/2" and W(0-,E — E„)^l/2". 

OO 

Writing P= lim inf E„ , we see that E — P = lim sup ( E — E n )(Z2(E — E„) 

n it u~n) 

for every m, and therefore, by (14.2), 

W ( 0-, E — P)< I W ( 0; E - E u ) ^ 

n - tn " 

which gives W(0;E — P) = 0. On the other hand, the lower var- 
iation W(<P; X) is a non-positive monotone function of a measurable 
set XQE, and, by Theorem 5.1 and the first inequality (14.2), 
we must have the relation |W( 0-, P)| ^ lim inf |W( 0\ E„) = 0, which 

n 

gives W (0;P) = 0 and completes the proof. 
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(14.3) Lemma. If (P(X) is a non-negative additive function of a 
set (9 E) on a set E, there exists, for each a > 0, a decomposition 
of E into a sequence of measurable sets without common points , 
H, E v E t , ..., E„, ... such that p(H) = 0 and that 

(14.4) a • (n — 1) • ,u(X) ^ 0(X) <o« • p(X) 

for every set X C E n measurable (9E). 

Proof. By Theorem 14.1, there exists, for each positive in- 
teger n, a measurable set A n such that <P(X) — an ■ p(X) ^ 0 "for 
every measurable set XC An and <#(X) — an ■ p(X)^0 for every 

co 

measurable set XQE — A„. Write B„ = Y A*. Any measurable 

k=~n 

oo 

subset X of B„ may be represented in the form X — £ X k , where 
X* are measurable sets, X*C for k = n, n+1, ..., and X,Xy=0 

oo oo 

fori^?; and 80 ^{X)=^ ^(Xk)~^)ialc • p(Xk)^an-p{X). We obtain 

k n k n 

thus a descending sequence of measurable sets { B„ } such that 

0(X)^an.fi(X) if XC B„, Xt% 

( ' J) 0(X)^an-p(X) if XC B — B„, Xe% 

the second relation being obvious, since E — B„ is a subset of 
E — An. 

Let us now write E X =E — B v E„=B n i — B„ for n— 2,3,..., 
and H=limB„. Thus defined the sets H, E v E 2 , ..., E,„ ... are mea- 

n 

oo 

surable and without common points, and E—H+J^E,,. Taking 

n i 

into account the relations (14.5), we see at once that the inequality 

(14.4) holds whenever X is a measurable subset of E n . Finally, H(ZB n 
for each positive integer n, and therefore, by the first of the rela- 
tions (14.5), we get $(H)^an • y{H), which requires n(H) = () and 
completes the proof. 

(14.6) Theorem . If E is a set ( S 3E) of finite measure (p), or, 
more generally , a set expressible as the sum of a sequence of sets (9E) 
of finite measure, every additive function of a measurable set $(X) 
on E is expressible as the sum of an absolutely continuous additive 
function W(X) and a singular additive function @(X) on E . Such a de~ 
composition of $(X) on E is unique , and the function W(X) is, on E, 
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the indefinite integral of a function integrable (9E, //) on E. If &>(X) 
is a non-negative monotone function on E y so are the corresponding 
functions *P(X) and ®(X). 

Proof. Since every additive function of a set is the difference 
of two non-negative functions of the same kind (cf. § 6), we may 
restrict ourselves to the case of a non-negative ${X). Further, we 
shall assume to begin with that the set E has finite measure. By the 
preceding lemma, there exists, for every positive integer m, a de- 
composition of E into a sequence of measurable sets H\ m \ E\ m \ Ei m \ ..., 
without common points and subject to the conditions: 

(14.7) E = H im) + E\ m) + ... + £ ( ( , m) + ... , fi(H {m) ) = 0, 

(14.8) 2“ m .(n — l).//(XX<P(X)<2 _, ".n-//(X), if ZC®!."'\ 

We therefore have, for all positive integers m, n , and k , 

2~ m ■n-ti(E ( ™ ) -ET +X) )> 0(E^ n) -ET +X) )>X m ^' • (fc— 1) • //(^ < „'" ) -/'4'" 
and 

2 m ~‘ k-n(&n ) -ET* l) )> 4>(E ( n m) ■ E ( r +X) )>2~ m ■ (»— l)-/»(BJr ) -A 1 i mH> ), 

from which it follows that (2 n — k f 1 ) // ( E ( ,T ) ■ btf" 0, and that 
( lc—2 n + 2) // (E ( ™ ] ■ Et' + ' ) )> 0. Hence // ( E\' n) • A’*" "’) =0 whenever 
either k>2n+l , or k<2n — 2. 

We may therefore write 

(14.9) E ( n m) c Etty+Ei';: , } ) +E , ;::+ ,) ~t M-f.” -i wq»'" , )#o. 

This being ro, let H= V H {m) + V £<"'>. We write /<"'>(*) - 2 — L) 

m~l 

for xeEl ',"' — //, »=1,2,.„, and /<"'>(a;) = 0 for atf//. We thus obtain 
a sequence {/‘"'’(x)) of non-negative functions measurable (9f) on the 
set E. By (14.9) we have clearly |/ ( " ,M> (.'r) — ‘2 on E, so 
that the sequence { f ni) (x )} converges uniformly on E to a non-neg- 
ative measurable function f(x). 

The set H being of measure zero, we have, by (14.7) and (14.8), 
for every measurable set XQ E and for every positive integer in, 

0(X) ^ 0(X-H) + Z 2 • (»— 1) • ft (X • £<'">) = 0(X ■ H) + f / ( '">dj«, 

and 

<P(XX<P(X.//)+2 , 2 E<"‘>) ^0(X H) [ f f-2 ""-//(X). 

II J 
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Hence, making m->oo, we derive $(X)= ff(X)du(x)+<D( X-H). 

x 

This decomposition, so far established subject to the hypo- 
thesis that the set E is of finite measure, extends at once to sets 
expressible as the sum of an enumerable infinity of sets of finite 

oo 

measure. In fact, if E=i%A„, where the A„ are sets (SE) without 

n = 1 

common points and of finite measure, then, by what we have al- 
ready proved, there exists on a non-negative function f„(x) in- 
tegrate on A n , and a measurable set H„(ZA„ having measure 

zero, such that d>(X-A n )=ff„dfi-\-0(X-H n ), for n— 1 , 2 ,... If we 

X-A„ 

now write H=]?H n , and f(x)=f„(x) for xfA n , we obtain a measur- 

n 

able set H(ZE of measure zero, and a function f(X), non-negative 
and integrate on E, such that, by Theorem 12.7, for every meas- 
urable set X(Z E 


(14.10) 0(X)=i; / fd f i + Z$(X-H n )= [ fdn + 0(X-H). 


X-A„ 


Now, the indefinite integral vanishes for every set of measure zero, 
and therefore is an absolutely continuous function; on the other 
hand, we have 4>(X-H )= 0 for every measurable set XQE — H. 
Thus, since the set H has measure zero, formula (14.10) provides 
a decomposition of <P(X) into an absolutely continuous function 
and a singular function. Finally, to establish the unicity of such 
a decomposition, suppose that ^(X) = I P 1 (X) + S 1 (X)= I P 2 (X)+S 2 (X) 
on E, the functions S^X) and W 2 (X) being absolutely continuous, and 
the functions ^(X) and ® 2 (X) being singular. Then ^(X) — !P 2 (X)= 
= @ 2 (X) — fy(X) identically on E, whence by Theorem 13.1 (2° 
and 6°), we have (P 1 (X)=W 2 (X) and & 1 (X)—& i (X), and this com- 
pletes the proof of our theorem. 


The expression of an additive function as the sum of an ab- 
solutely continuous function and of a singular function will be termed 
the Lebesgue decomposition. The singular function that appears in 
it is often called the function of the singularities of the given function. 
From Theorem 14.6, we derive at once 
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(14.11) Theorem of Madon-Nikodym . If E is a set of finite 
measure , or, more generally the sum of a sequence of sets of finite mea- 
sure (y), then , in order that an additive function of a set (9E) on E be 
absolutely continuous on E y it is necessary and sufficient that this 
function of a set be the indefinite integral of some integrable function 
of a point on E. 

The hypothesis that the set E is the sum of an at most enumerable infin- 
ity of sets of finite measure, plays an essential part in the assertion of the theorem 
of Radon -Nikodym, just as in Theorem 9.8. To see this, let us take again the 
interval [0, 1] as our space X v and let the class 9E X of all subsets of [0, 1] that 
are measurable in the Lebesgue sense (cf. below Chap. Ill) be our fixed additive 
class of sets in the space X x . A measure /q will be defined by taking ^ x (X)— oo 
for infinite sets and < M 1 (JT)=n for finite sets with n elements. This being so, the 
sets (SEj) of measure (//j) zero coincide with the empty set, and therefore, every 
additive function of a set (9^) on X x is absolutely continuous ($ lt f4 x ). In particular,, 
denoting by A(X) the Lebesgue measure for every set X e 9E lf we see that A(X) 
is absolutely continuous (9t lf on X v We shall show that A(X) is not an inde- 
finite integral ($,, /r,) on X x , Suppose indeed, if possible, that 

A (■ = f 9 ^ d,<i ^ 

X 

for every set X e 9£ 2 , the function g(x) being integrable (X v t u x ) on X v Since 
A(X) is non-negative, we may suppose that g(x) is so too. Let E~E[g(x) s 0] 

X 

and En~^[g(x) 1/h] for n— 1,2,... We have AfA^ — E)-?Jgdp l ~ 0, so that 

xf-E 

yl(£') = y1(A 1 ) = l and this requires the set E to be non -enumerable. Since E~2En, 

n 

the same must be true of En« for some positive integer w 0 . Thus 
A (E'J = f g(*)d/'i(x) & Hi (U„ n )/n 0 =oo, 

which is evidently a contradiction. 


§ 15. Change of measure. Any non-negative additive func- 
tion v(X) of a set (9£) may clearly be regarded as a measure cor- 
responding to the given additive class 90 When such a function 
v(X) is defined only on a set E, we can always continue it (cf. § 5, p.9) 
on to the whole space. The terms measure (v), integral (SE,*”) 
etc. are then completely determined for all sets (90, but, in this 
case, it is most natural to consider only the subsets of E for which 
the function r{X) was originally given. 
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(15.1) Theorem. Whenever, on a set E measurable (9E), we have 

(15.2) v(X) = (90 / g(x)d,u(x) + $(X) 

X 

where 9(X) is a non-negative function , additive and singular (9 t,p) 
ow E, and where g(x) is a non-negative function integrable (96, ju) 
oyer 2?, Men aiso 

(15.3) (9E) f j(x)dv(x) = (9t) f f(x)g(x)du(x) + (90 f f(x)d9(x) 

XX X 

for every set XCE measurable (96) and for every function f(x) that possesses 
a definite integral (96, i>) overX. If , further , the function f(x) is integrable 
(%v) over Ej the formula (15.3) expresses the Lebesgue decomposition 

of the indefinite integral f f dr on E , corresponding to the measure p, 

x 

the function 0(X) = f f d& being the function of singularities (96, /*) 
x 

of the indefinite integral f f dr. 

x 

Proof. We may clearly assume that f(x) is defined and non- 
negative on the whole of the set E. We see at once that, for each 

set YC.E measurable (96), / c Y (x)dv(x) = v(Y)= f g(x)du(x) + d(Y)-= 

Y Y 

= J c Y (x)g(x)dp(x) + I c Y (x)dd(x), and hence also that for every 

Y Y 

finite function h(x) simple and measurable (96) on a set IC®? 

(15.4) j h(x)dv(x) = j h(x) g(x) du(x) -f f h(x)d&(x). 

XX X 

Let now {A„(a?)} be a non-decreasing sequence of finite simple 
functions measurable (96) and non-negative on X , converging to the 
given function f(x). Substituting h n (x) for h(x) in (15.4) and making 
n->c o, we obtain (15.3), on account of Lebesgue’s Theorem 12.6. 
If, further, f(x) is integrable (96, v) over E, the identity just es- 
tablished shows at once that the product f(x) g(x) is integrable (96, p) 

over E and hence, that the indefinite integral / fgdp is absolutely 

x 

continuous (96, p) 'on E. On the other hand, the function 0(X) 
vanishes on every set on which the function &(X) vanishes, and 
therefore, is singular (96, p) on E together with #(X). This completes 
the proof. 



38 


CHAPTER I. The integral in an abstract space. 


The wide scope of Theorem 15.1 is due to the fact that, if 
/<(Z) and v(X) are any two measures associated with the same class 
9 1 of measurable sets and we have at the same time n(E) < + oo, 
and v(E) < + oo, for a set Ee % then the measure v can be repre- 
sented on E in the form (15.2), where g(x) is a function integrable 
(%fi) over the set E and #(X) is a non-negative function, additive 
and singular (% u) on the same set (cf. Th. 14.6). Hence, with the 

above hypotheses and notation, in order that ffdv = jfgdp 

x x 

should hold identically on E, it is necessary and sufficient that 

the indefinite integral J fdv be absolutely continuous {%n) on E. 

x 

This condition is clearly satisfied whenever the measure v(X) is 
itself absolutely continuous (% ft). 



CHAPTER II. 


Carath6odory measure. 

§ 1. Preliminary remarks. In the preceding chapter, we 
supposed given a priori a certain class of sets, together with a meas- 
ure defined for the sets of this class. A different procedure is usually 
adopted in theories dealing with special measures. We then begin 
by determining, as an outer measure, a non-negative function 
of a set, defined for all sets of the space considered, and it is only 
a posteriori that we determine a class of measurable sets for which 
the given outer measure is additive. 

An abstract form of these theories, possessing both beauty 
and generality, is due to C. Carath6odory [I]. The account that 
we give of it in this chapter, is based on that of H. Hahn [I, Chap. VI], 
in which the results of Carath^odory are formulated for arb- 
itrary metrical spaces. This account will be preceded by two §§ 
describing the notions that are fundamental in general metrical 
spaces. 


§ 2. Metrical space. A space M is metrical if to each pair 
a and b of its points there corresponds a non-negative number Q(a, b ), 
called distance of the points a and b, that satisfies the following 
conditions: (i) Q(a, £>)=0 is equivalent to a=b, (ii) Q(a, b)=g(b, a), 
(iii) Q(a, b) g(b, c)~^ g(a, c). In this chapter, we shall suppose 
that a metrical space M is fixed, and that all sets of points that 
arise, are located in M. 

The notation that we shall use, is as follows. A point a is limit 
of a sequence {a,,} of points in M, and we write a = lim a „ , if 

It 

lim (/(a, a„)=0. Every sequence possessing a limit point is said to 
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be convergent. Given a set M, the upper bound of the numbers Q(a, b) 
subject to aeM and beM is called diameter of M and is denoted 
by d(M). The set M is bounded if S(M) is finite. For a class SOI 
of sets, the upper bound of the numbers d(M) subject to ilftSO? 
is denoted by d(S01) and called characteristic number of SOI. 
By the distance p(a, A) of a point a and a set A, we mean the lower 
bound of the numbers p(a, x) subject to xe A, and by the distance 
q(A,B) of two sets A and B, the lower bound of the numbers q(x, y) 
for xeA and y e B. 

We call neighbourhood of a point a with radius r > 0, or open 
sphere S(a; r) of centre a and radius r , the set of all points x such 
that Q(a,x)<r. The set of all points x such that p(a,x)^r is 
called closed sphere of centre a and radius r, and is denoted by S (a; r). 

A point a is termed point of accumxdation of a set A, if every 
neighbourhood of a contains infinitely many points of A. The set 
A' of all points of accumulation of A is termed derived set of A. 
The set A + A', that we denote by A, is termed closure of A. If A — A» 
the set A is said to be closed. The points of a set, other than its 
points of accumulation, are termed isolated. A set is isolated, if all 
its points are isolated. We call perfect, any closed set not containing 
isolated points. 

A point a of a set A is said to be an internal point of A , if there 
exists a neighbourhood of a contained in A. The set of all the internal 
points of a set A is called interior of A and denoted by A". The set 
A — A is termed boundary of A. If A=--A°, the set A is said 
to be open. Two sets A and B are called non-overlapping, if 
AB B =BA°—0. 

The class of all open sets will be denoted by © and that of 
all closed sets by 5 . 1° accordance with the convention adopted 
in § 2 of Chap. I, p. 5, open and closed sets will also be termed 
sets (©) and sets (5) respectively. We see at once that the com- 
plement of any set (©) is a set ($) and vice-versa. 

The sum of a finite number or of an infinity of open sets, as 
well as the common part of a finite number of such sets, is always 
an open set. Any common part of a finite number, or of an infinity, 
of closed sets, and also any sum of a finite number of such sets, 



Metrical space. 


41 


l§2] 


are closed sets. Nevertheless, the sets (fto) and (($>*) (cf. Chap. I, § 2, 
p. 5) do not in general coincide with the sets (5) and (©), although 
every set (5) is clearly a set (5„) and, at the same time, a set (©«>); 
for, if F is a closed set and 6„ denotes the set of the points x such 
that q(x, F)<l/n, we have F=nG„, where G„ are open. The cor- 

n 

responding result for the sets (©) is obtained by passing to the 
complementary sets. Moreover, it follows that any set expressible 
as the common part of a set (5) and a set (©) is both a set (%„) and 
a set (<&<j). 

We shall denote by 95, the smallest additive class that includes 
all closed sets (cf. Chap. I, Th. 4.2). This class, clearly, includes also 
all sets (®«i) and ($„). The sets (93) are also termed measurable (13) 
(in accordance with Chap. I, § 4, p. 7). They are known as Borel sets. 

We shall also give a few “relative” definitions having re- 
ference to a set M. The common part of M with any closed set 
is dosed in M ; we see at once that, for a set PQ M to be closed 
in M, it is necessary and sufficient that P=MP , i. e. that the 
set P contains all its points of accumulation belonging to M. Sim- 
ilarly, any set expressible as the common part of M and an open 
set is termed open in M. 

Any set of the form ilf-S(a; r), where aeM and r>0, is called 
portion of M. If every portion of M contains points of a set A, 
i. e. if M, the set A is said to be everywhere dense in M. If a set B 
is not everywhere dense in any portion of M , i. e. if no portion 
of M is contained in B, the set B is said to be non -dense in M. In 
other words, a set B is non-dense in M, if, and only if, each portion 
of M contains a portion in which there are no points of B. It fol- 
lows at once that the sum of a finite number of sets non-dense in 
the set M is itself non-dense in M. The sets expressible as sums 
of a finite or enumerably infinite number of sets non-dense in M 
are termed (according to R. Kaire [1]) sets of the first category in M, 
and the sets not so expressible are termed sets of the second category 
in M. In all these terms, the expression “in M” is omitted when 
M coincides with the whole space; thus, [by “non-dense sets”, 
we mean sets whose closures contain no sphere and by “sets of 
the first category”, enumerable sums of such sets. 

A set M is called separable, if it contains an enumerable 
subset everywhere dense in M. 
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§3. Continuous and semi-continuous functions. If /(*) 
is a function of a point, defined on a set A containing the point a, 
we shall denote by M^(/; a; r) and m a; r ), respectively, the 
upper and lower bounds of the values assumed by f(x) on the por- 
tion -4-S(a; r) of the set A. When r tends to 0, these two bounds 
converge monotonely towards two limits (finite or infinite) which 
we shall call respectively maximum and minimum of the function 
f(x ) on the set A at the point a, and denote by M^(/; a) and m A (fi a). 
Their difference o^(/; a) = M^(/;a) — m^(/; a) will be called oscil- 
lation of f(x) on A at a. We clearly have 

(3.1) m^(/; a) ^ f(a) ^ M. 4 (/; a) for every point aeA. 

If f(a) = n\ A (f-, a), the function f(x) is said to be lower semi- 
continuous on the set A at the point a; similarly, if f(a) — M A (f; a), 
the function f(x) is tipper semi-continuous on A at a. If both con- 
ditions hold together, and if f(x) is finite at the point o, i. e. if 
m .<(/; a)= M A (f-, a ) * oo, the function f(x) is termed continuous on A 
at the point a. Functions having the appropriate property at all 
points of the set A, will be termed simply lower semi-continuous, 
or upper semi-continuous, or continuous, on A. In all these terms 
and symbols, we usually omit all reference to A, when the latter 
is an open set (in particular, the whole space), or when A is kept 
fixed, in which case the omission causes no ambiguity. 

From these definitions we conclude at once that, if f(x) is 
upper semi-continuous, the function — f(x) is lower semi-continuous, 
and vice-versa; and further, that, if two functions are upper (or 
lower) semi-continuous, so is their sum (supposing, of course, that 
the functions to be added do not assume at any point infinite values 
of opposite signs). 

(3.2) Theorem. For every function f(x ) defined on a set A, the 
set of the points of A at which f(x) is not continuous on A, is the com- 
mon part of the set A with a set (*$„). 

Proof. Let us denote by F„ the set of the points x of A at 
which either f(x)—±oo, or o^/;#)^ l/». The set F = -2 F„ con- 

n 

sists of all the points of A at which the function f(x) is not con- 
tinuous. Now it is easy to see that each of the sets F„ is closed in A, 
i. e. that F n —A-F n . Therefore F is the common part of A and the 
set 2F„, which is a set ($„)• 
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(3.3) Theorem. For a function of a point f(x) to be upper [lower] 
semi-continuous on a set A, it is necessary and sufficient that , for 
each number a, the set 

(3.4) E [x <- A ; f(x) ^ a] [E [x e A ; /(a?)< a]] 

X X 

be closed in A, i. e. expressible as the common part of A with a set (5). 

Proof. We need only consider the case of upper semi-contin- 
uous functions, as the other case follows by change of sign. 

Let f(x) be a function upper semi -continuous on A, a an ar- 
bitrary number, and x 0 eA a point of accumulation of the set (3.4). 
For each r>0, the sphere S(a; 0 ; r) then contains points of that set, 
and this requires M A (f; x 0 ; r)^a and so M A (f; x 0 )^a. Since by 
hypothesis M A (f; x 0 ) = f(x 0 ), we derive f(x 0 ) ^ a, so that x 0 belongs 
to the set (3.4). This set is thus closed in A. 

Suppose, conversely, that the set (3.4) is closed in A for each a. 
Since the relation M A (f;x) = f(x) is evident for any x at which 

f(x) = + oo, let x 0 be a point at which f(x 0 ) < + oo, and a any 

number greater than f(x 0 ). The set (3.4) is closed in A and does not 
contain x 0 , and so, for a sufficiently small value r n of r, contains 
no point of the sphere S (x 0 -,r). Thus M*(/; x 0 )^M A (f; x 0 -, r„)^.a for 
every number «>/(#„), and hence M^(/; x 0 )^.f(x 0 ), which, by (3.1), 
requires M. A (f; x 0 ) = f(x 0 ). 

An immediate consequence of Theorem 3.3 (cf. Chap. I, § 7, 
particularly p. 13) is the following 

(3.5) Theorem. Every function semi-continuous on a set (©) is 

measurable ( < B) on this set. More generally, if 9E is any additive 

class of sets including all closed sets (and so all sets measurable (©)), 
every function semi-continuous on a set (9 E) is measurable (9 E) on 
this set. 


8 4. Carath6odory measure. A function of a set F(X), 
defined and non-negative for all sets of the space M, will be called 
outer measure in the sense of Caralhfodory, if it fulfills the fol- 
lowing conditions: 

(C,) r(X)<^T(Y) whenever AC^, 

(C s ) T(EX.)^2 r(X,) for each sequence {AT,} of sets, 

(C,) P( X+ Y) = I'(X) + F( Y) whenever q(X, Y) > 0. 
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It should be noted that of these three conditions, the last one, only, has 
a metrical character. Now in this §, as well as in the §§ 5 and 6, we shall use 
only properties (Ci) and (C,) of the Carath4odory measure. Hence all the re- 
sults of these §§ remain valid in a perfectly arbitrary abstract space. 

In order to simplify the wording, we shall suppose, in the 
rest of this chapter, except in § 8 which is concerned with certain 
special measures, that an outer Carath^odory measure T(X) is 
u; oiquely determined in the space considered. 

i A set E will be termed measurable with respect to the given 
outer measure T(X), if the relation P(P+Q) = r(P)+r(Q) holds 
for every paJ'V'd sets P and Q contained, respectively, in the set E 
and in its complement C E what amounts to the nai^e, if 

E{X)—r(X-E)+r(X-GE) holds for every set X. By condition (C 2 ) 
this last relation may be replaced by the inequality r(X)^? 

^r(x-E) + r(x-GE). 

The class of all the sets that are measurable with respect to /', 
will be denoted by fir. We see at once that this class includes all 
the sets X for which E(X)~0 (in particular, it includes the empty 
set). Moreover it is clear that complements of sets (fir) are also 
sets (fir). 

The main object of this § is to establish the additivity of 
the class fir (in the sense of Chap. I, § 4) and to prove that the 
function J'(X) is a measure (fir) in the sense of Chap. I, § 9. 
This result will constitute Theorems 4.1 and 4.5. 

(4.1) Theorem. If 8 is the sum of a sequence {AT„};i=i, 2 , ... of sets (fir) 
no two of which have common points, the set 8 is again a set (fir) and 
r(S) = 2 r(X n y, more generally, for each set Q 

n 

(4.2) r(Q ) = 2 r(Q -X„) -f r(Q . C S). 

n 

k 

Proof. Let 8/,= 2 X„. We begin by proving inductively that 

/I— 1 

all the sets *5* are measurable with respect to V, and that, for each k 
and for every set Q, 

(4.3) r(Q)^2r(Q.X n ) + r(Q.C8„). 

/I— I 

Suppose indeed, that 8 P is a set (fir) and that the inequality (4.3) 
holds for every set Q, when k = p. Since X p+1 is, by hypothesis, 
a set (fir) and S p -X p+ i — 0, we then have 



CaratModory measure. 


45 


[§ 4 ] 


r(Q) = r(Q.x p+i ) + r(Q.cx p+] ) = 

= r(Q-x p+i ) + r(Q. c x p+r s p ) + r(Q.cx p+r cs p ) = 

p~\~\ 

= AQ-Zp+O + P(Q-8 p ) + ir{Q-X n ) +r(Q.CS p+i ) 

n 1 

and this is (4.3) for k — p 1. In view of condition (C 2 ), p. 43, 
it follows further that r(Q)^ l'(Q ■ S p ^i) + r(Q-CS P r\), which 
proves that 8 P+ 1 is a set {Hr). 

Combining the inequality (4.3), thus established, with the in- 
equality r(Q-CSi,) ^ r(Q-C8), we obtain, by making k->oo, the 

inequality P (Q) ^ £ r(Q-X„) -f r(Q-CS), and from this (4.2) 

n~ 1 

follows on account of condition (C 2 ). 

Finally, the same condition enables us to derive from (4.2) 
that F(Q)^ T(Q- S)+r(Q-C8), and this shows that 8 is a set (Hr) 
and completes the proof. 

(4.4) Lemma. The difference of two sets (Hr) is itself a set (Hr). 

Proof. Let XeH r and Y eHr, and let P and Q be any two 
sets such that PCX — Y and #CC(Y — Y). Write Q l — Q Y 
and Q 2 = Q CY. Making successive use of the three pairs of in- 
clusions &CF, Q 2 CCF; PCX, Q 2 CC(X-Y)-CYCCX-, and 
QxCX, P+QtCCY, we find r(P)+r(Q)=/’(P)+7’(Q 1 )+r(^ t )= 
= r(P-\-Q 2 )-{-r(Q 1 )=r(P-\-Q) ) which shows that X — Y is a set (8/ ). 

(4.5) Theorem. Hr is an additive class of sets in the space M. 

Proof. We have already remarked (p. 44) that the empty 
set and that complements of sets (Hr) are sets (Hr). To verify the 
third condition (iii) for additivity (cf. Chap. I, § 4, p. 7), let us 
observe firstly that, on account of Lemma 4.4 and of the identity 
X Y — X — C Y, the common part of any two sets (Hr) is itself 
a set (Hr). This result extends by induction to common parts of 
any finite number of sets (Hr) and, with the help of the identity 
2 Xi= C II CXi, we pass to the similar result for finite sums of 
sets. Finally, if X is the sum of an infinite sequence {X„)„ . 1 . 2 ,... 
of sets (8r), we have X=8 1 +5f (S„+i — 8„) where S„=£ X*. Now, 

/I— I k— 1 

clearly, of the sets S, and <SVh — S„, no two have common 
points, and, moreover, by the results already proved, they all 
belong to the class Hr. Consequently, to ascertain that X is a set (Hr), 
we have only to apply Theorem 4.1. The class Sr is thus additive. 
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Theorem 4.6 connects the considerations of this chapter with 
those of the preceding one. Thus, in accordance with the conven- 
tions adopted in Chap. I, pp. 7 and 16, the sets (fir) may be termed sets 
measurable (fir), and -T(X) may, for Xefir, be regarded as a 
measure associated with the class fir. This class, together with 
the measure -T, determines further the notions of functions 
measurable (fir), of integral (fir, I 7 ), of additive function 
of a set (fir) absolutely continuous (S r , I 7 ), and the other 
notions defined generally in Chap. I. Since the outer measure J* 
determines already the class fir, we shall omit in the sequel the 
symbol representing this class, whenever the notation makes expli- 
cit reference to the outer measure; thus we shall say “function 
integrable (/’)” instead of “function integrable (fir,/ 7 )” and the 
integral (I 7 ) of a function f(x) over a set E will be denoted simply 

by J f(x)d,r(x), instead of by ( 2 r) f f(x)dr(x). 

K E 

In accordance with Chap. I, § 9, the value taken by E(X) for 
a set X measurable (fir) will be termed measure (I 7 ) of X ; when 
X is quite arbitrary, this value will be called its outer measure (V). 

If E 0 is a subset of a set E such that E(E — E 0 ) = 0 , then 
for any function f(x) on E the measurability (fir) of / on E is 
equivalent to its measurability (fir) on E 0 . This remark and 
Theorem 11.8, Chap. I, justify the following convention: 

If a function f(x) is defined only almost everywhere (T 7 ) 
on a set E, then, E 0 denoting the set of the points of E at which 
f(x) is defined, by measurability (fir), integrability ( E) and integral (/’) 
of f on the set E we shall mean those on the set E 0 . 

Let us note two further theorems. 

(4.6) Theorem. Given an arbitrary set E, (i) r(E-2 X„)~ SE(E-X„) 

n n 

for every sequence {X„} of sets measurable (fir) no two of which have 
common points, (ii) r(E- limX„) = lim r(E-X„) for every ascending 

n n 

sequence {X,,} of sets measurable (fir), and this relation remains valid 
for descending sequences provided , however , that r(E-X x ) =^oo, 
(iii) more generally , for every sequence {X„} of sets measurable (fi r) 
r (E limint X„)^liminl r(E X n ), and , if further r(E-2 X n ) 4" °°> 

n n a 

then also E(E- lim sup X„) > lim sup r(E-X„). 
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Part (i) of this theorem is contained in Theorem 4.1, and 
parts (ii) and (iii) follow easily from (i) (cf. Chap. I, the proofs of 
Theorems 5.1 and 9.1). 

A part of Theorem 4.6 will be slightly further generalized. Given 
a set E, let «s denote, for any set A, by F'f (X) the lower bound of 
the values taken by F(E-Y) for the sets Y measurable (2/) that 
contain X. 

(4.7) Theorem. Given a set E, (i) to every set X corresponds 
a set X° 3 X measurable (2 r) such that Ff. (X) — 7’(2?-Z 0 ), 
(ii) T(E ■ lim inf X„) ^ Z™ (lim inf X„) ^ lim inf r% (X„) for every se- 

II II ll 

quence {X n ) of sets, and, in particular, T(E • lim X n ) ^ /^(lim X,,)— 

n n 

— lim r° E (X n ) for every ascending sequence {X,,}. 

n 

Proof, re (i). For every positive integer n there is a set r„3) A, 
measurable (2 /•), such that r(E • Y„) ^ Ff (X) + l/n. Writing 
A' 0 = TJ Y„, we verify at once that the set A 0 has the required 

tl 

properties. 

re (ii). Taking (i) into account, let us associate with each set X n 
a set X nZ)X n , measurable (2 r) and such that r(E-Xl) = rl.(X„). 
The set lim inf 3) lim inf X„ is measurable (2 r) and, we therefore 

n n 

have, by Theorem 4.6 (iii) 

r£(liminf A„)< f(E • lim inf A,“) ^ lim inf 7 1 (EA“)=liminf rj?.(X n ). 

ii n n n 

The second part of (ii) follows at once from the first part. 

* § 5. The operation (A). We shall establish here that 
measurability (2; ) is an invariant of a more general operation 
than those of addition and multiplication of sets. 

We call determining system, any class of sets ')l={A nil n ., ■ in 
which with each finite sequence of positive integers %, w 2 , ...,n* there 
is associated a set A,,, ,,, „ h . The set 

(5.1) 2j 

n.."; »/(••■• 

where the summation extends over all infinite sequences of indices 
»!, n 2 , ..., »*, ..., is called nucleus of the determining system *21 and 
denoted by N(2l). The operation leading from a determining system 
to its nucleus is often called the operation (A). 
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The operation (A) was first defined by M. Souslin [1] in 1917. When 
applied to Borel sets, it leads to a wide class of sets (following N. Lusin, we call 
them analytic) and these play an important part in the theory of sets, in the 
theory of real functions, and even in some problems of classical type. A system- 
atic account of the theory of these sets will be found in the treatises of H. Hahn 
[II], F. Hausdorff [II], C. Kuratowski [I], N. Lusin [II] and W. Sierpin- 
ski [II]. 

Wd mentioned at the beginning of this } that the operation (A) includes those 
of addition and of multiplication of sets. This remark must be understood as follows: 
If SOI is a class of sets such that the nucleus of every determining system formed of sets (9J1) 
itself belongs to $01, then the sum and the common part of every sequence {Ni} of sets 
($01) are also sets^Ul). In fact, writing P n ^n 2 , N ni and Qn it n 2t . .,n k =Nk we 

see at once that the nuclei of the determining systems {Pn v n 2 ,...,n k } and 
{Qn v n 2 ,. .,n h } coincide respectively with the sum and with the common part of 
the sequence {Ni}. Thus, Theorem 5.5, now to be proved, will complete the result 
contained in Theorem 4.5, and in conjunction with Theorem 7.4, establish mea- 
surability (B/0 for analytic sets in any metrical space (cf. N. Lusin and 
W. Sierpinski [1], N. Lusin [3, pp. 25 — 26], and W. Sierpinski [12; 15]). 
The proof of this can be simplified if we assume regularity of the outer meas- 
ure P (cf. § 6) (see C. Kuratowski [I, p. 58]). 

With every determining system *31 = {A„ vn2t ..., nk }, we shall also 
associate the following sets. u a i»* 2 » ($1) will denote, for each 

finite sequence h 19 A 2 , ..., h s of positive integers, the sum (5.1) 
extended over all sequences, n v n 2J ..., n*, ... such that for 

We see at once that the sequence {N* (51 )}a = i, 2 ,..., 

together with every sequence is monotone 

ascending and that 

(5.2) N(<H) = limJSrA(9I), ^(91) = ^ •'**'’(21). 

h h 

Further, for every sequence of positive integers h v h 21 h k , ..., we 
shall write 

(*21) = ^ n if As(^0 ~ /i*, • • * y 

n j /i| n,3-h, 

h'2, •••» h/fi ng, njjj • * • * 

n 2S*2 "k= h k 

We see directly that if the sets of the determining system *31 

belong to a class of sets 901, the sets N Al> /, 2 belong to the 

class 901« 0 . 
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(5.3) Lemma . For every determining system m={A„ t ,„ 2 and 

for every sequence of positive integers h u h ir ..., h *, ... 

(5.4) N Al (9l) • ^ hvh2 m..^ hvh2y ..., hk m • ...CN(«), 

Proof. Let x be any point belonging to the left-hand side 
of (5.4). We shall show firstly that a positive integer can 

be chosen so that, for each Tc^ 2, the point x belongs to a set 

A„^A nv n 2 *...'A„ l9 „ 2 nk for which n x =n° x and for i= 2, 3, ..., k. 

Indeed, if there were no such integer n° x1 we could associate with 
each index n^h lr a positive integer k n , such that x belongs to no 
product A„-A n , n2 >...-A„ t „ 2t ... |Jljk for which n { ^hi when i=2, 3, ..., k n . 
Denote by p x the greatest of the numbers k 21 ..., k hi . The point x 
thus belongs to none of the sets A ni -A n{tn2 -...-A nit „ 2% , for which 
when i = 1, 2, ..., p 19 and therefore is not contained in 
their sum (^I). This is a contradiction since, by hypo- 

thesis, x is an element of the left-hand side of (5.4). 

After the index n° x , we can determine afresh an index 
so that, for each k^ 3, the point x belongs to asetA /I1 -A. /I1 , /l2 *...-A /ll ,„ 2>> 
for which n x = n° x , w 2 = an( l Ui ^ w ^en i = 3, 4, ..., fc. For, if 
there were no such index, we could find, as previously, a positive 

integer P2>3 such that x belongs to no product A n ^A nitn2 -..^A nv „ 2 „ 

for which n x =n\ and when i=2, 3, ..., p 2 . And this would 

contradict the definition of the index n° x . 

Proceeding in this way, we determine an infinite sequence of 
indices {n° ( } such that when i = 1 , 2, ... and such that 

xe A n o.A n o >n o.....A w o n o > . ^ >w o.. # . . Thus #eN(2J), and this completes 

the proof. 

Lemma 5.3 is due to W. Sierpinski [13]. The proof contains a slightly 
more precise result than is expressed by the relation (5.4) and shows that the 
left-hand side of that relation coincides with the sum (5.1), when the latter is 
extended only to systems of indices n lt n 2 , ..., nk , ... restricted to satisfy 

n i ^ n 2 ^ ^2* **•> n k ^ ^ k* *•* * 

Let us call degenerate, a determining system {A n ^n 2 , ...,/»*} such that, for 
some sequence {hk} of positive integers, we have An x ,n 2 , ...,n k — 0 whenever nk hk . 
Then, for this sequence {hk), the relation of inclusion (5.4) becomes an identity 
and we are led to the foUowing theorem: 

If a degenerate determining system consists of sets belonging to a class SR, 
its nucleus is a set (SR<5a<q. A similar theorem cannot hold for non -degenerate 
systems: in fact, as shown by M. Sou si in, the operation (A) applied to Borel 
sets (and even to linear segments) may lead to sets that are not Borel sets (cf. 
F. Hausdorff [II, p. 182—184]). 
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(5.5) Theorem. The nucleus of any determining system l 3l={^4n 1 ,n 2 ,...,/i A } 
consisting of sets measurable (2 1 ) is itself measurable (2 r). 

Proof. Let us write, for short, 

N = N(9l), N" i"2 "* = N n i' n 2 "*(51), N„ v „ 2 „ k = N n]i „ 2 „ A (2I). 

We have to show that, for any set E 

(5.6) P(E)^ T(E.N)+ P(E-CN). 

We may assume that r(E) < 00 , since (5.6) is evidently fulfilled 
in the opposite case. 

Let us denote (as in § 4, p. 47) by /^(JT) the lower bound 
of the values of P(E-Y) for sets YZ)X measurable (2 r) and let e 
be an arbitrary positive number. Taking into account (5.2) and 
Theorem 4.7, we readily define by induction a sequence of positive 
integers { hi, } such that r a E (N h ')^ P(E-N ) — f/2 and 

P£(N h i a 2 h k)~^ ri(N h \<H h k- 1 ) — c/2* for k = 2, 3, ... . 

Thus the sets N„ v „ 2< • ,n*D-^ r " 1 ’" 2 ’ ' "’"* being measurable (2/ ) together 
with the Ani, 11 %, nfcf 

r(E- N hl , h2 „ k ) > *2 **) ^ I'(E-N)—e 

for each k, and therefore 

r(E) = r{E.N hl , ht '..., hk) +r(E-CN hv „ 2 hk )^ 

( ^ r(E.N) + r(E-GN hi , h2 „. <hk ) — c. 

Now the sequence of sets {Ni, t ,h 2 h k }t,^ 1 , 2 , .. is descending, and 

byLemma5.3its limit is a subset of N. The sequence (0 
is thus ascending and its limit contains the set C N. Hence, making 
fc— ► 00 in (5.7), we find, by Theorem 4.6(ii), the inequality 
r(E)^r(E-N)+r(E-CN) — e, and this implies (5.6) since £ is an 
arbitrary positive number. 

§ 6. Regular sets. A set X will be called regular (with respect 
to the outer measure JP), if there exists a set A measurable (2/-), 
containing X and such that P(A) = P(X). Every measurable set 
is evidently regular, and so is also every set X whose outer measure ( P ) 
is infinite, since we then have r(X)—P(M)—oo. If every set 
of the space considered is regular with respect to the outer measure r , 
this measure is itself called regular; cf. H. Hahn [I, p. 432], 
C. Carath6odory [1; I, p. 258]. 



Borel sets. 


51 


[§71 


Denoting by r°(X) the lower bound of the values of -F( Y) 
for sets IJY measurable (2 r), we see readily that the relation 
r°(X) = r(X) expresses a necessary and sufficient condition for 
the set X to be regular. From Theorem 4.7(ii), taking for the 
set E the whole space, we derive the following: 

(6.1) Theorem. For- any sequence {Z n ) of regular sets /'(liminf X„)^ 

n 

^ liminf -F(X„), and, if further the sequence (X„) is ascending 
r(\imX„) = \imr(X n ). 

n n 


The generality and the importance of this theorem consist in that all 
outer measures F that occur in applications satisfy the condition of regularity, 
and, for these measures, the last relation of Theorem 6.1 therefore holds for 
every, ascending sequence of sets. Nevertheless, for measures that are not them- 
selves regular, the restriction concerning regularity of the sets X n is essential 
for the validity of Theorem 6.1 as is shown by an example of irregular measure 
due to C. Carath^odory [II, pp. 693 — 696]. 

We may observe further that, for any fixed set E, the function of a set / e( X), 
defined in § 4, p. 47, is always a regular outer measure, even if the given measure 
r(X) is not. Conditions (C x ) and (C 2 ) together with that of regularity, are at once 
seen to hold, and (C 8 ) may be derived from Theorem 7.4, according to which 
closed sets are measurable (C/ 1 )* 

§ 7. Borel sets. We shall show in this § that, independently 
of the choice of the outer measure l 1 , the class 2/’ contains all 
Borel sets. 

(7.1) Lemma . If Q is any set contained in an open set 0 , and Q„ 
denotes the set of the points a of Q for which p(a, CO)^l/n, then 
lim F(Q n ) = r(Q). 

n 

Proof. Since the sequence {#„} is ascending and Q = lim Q,„ 

n 

it suffices to show that lim r(Q n )^J'(Q). For this purpose let us 

n 

write D n —Q„+i — Q n . We then have Q(D n+l , Q„)^l/n(n+l) > 0, 
provided that D„+ 1=}=0 and Q n =J= 0. Hence, taking into account 
condition (C 3 ), p. 43, it is readily verified by induction that 

( 7 . 2 ) r{Q u+l )^r(Zb ik )=Zr(D ik ),r(Q in )^i\XDu-^Zr(D lk ,) 

k^\ k^\ k^l h-\ 

oo 

for every positive integer n. Writing, for short, a„=£ r{D 2 k ) and 

kn 
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OO 

b„ — £ r(D ik -i ), we obtain at once, by condition (C 2 ), p. 43, 

k— m +1 

(7.3) r(Q) ^ r(Qi n ) + a n + Sh- 

oo 

Now two possibilities arise: either both series, £ r(D ik ) and 

oo 

v l\D ik - { ) have finite sums, or, one at least has its sum infinite. 
& 

In the former case and b„-> 0, so that, the required inequal- 
ity I 1 (Q) r(Q„) follows by making n-+oo in (7.3); while, in 

n 

the latter case, the inequality is obvious, since by (7.2) we have 
then lim r(Q„) — oo. 

/l 

(7.4) Theorem. Every set measurable (93) is measurable (2 r ). 

Proof. Since the class 2/’ is additive and since 93 is the smallest 
additive class including the closed sets (cf. § 2, p. 41), it is enough to 
prove that every closed set is measurable (2/), i. e., denoting any 
such a set by F, that 

(7.5) r(P+Q)^r(P) + F(Q) 

holds for every pair of sets P(ZF and QC_CF. Since the set CF 
is open, there is, by Lemma 7.1, a sequence {Q„} of sets such that 
QnCQ, Q(Qn, F) > 1 In for »*=1, 2, ..., and lim r(Q„) = r(Q). Thus 

/I 

q(Q„, P) ^ Q(Q n , F) > 0, and so, on account of condition (C 3 ), p. 43, 
we derive P(P + Q)^ P(P + Q„) = T(P) -f- r(Q„) for each n, and, 
making n->o o, we obtain (7.5). 

The arguments of this § depend essentially on property (C 3 ) of outer meas- 
ure, and on the metrical character of the space 3/, which did not enter into 
§§ 4 — 6. It is possible however to give to these arguments a form, independent 
of condition (C 3 ), valid for certain topological spaces that are not necessarily 
metrical (cf. N. Bourbaki [1]). 

From the preceding theorem coupled with Theorem 3.5, we 
derive at once the following 

(7.6) Theorem . (i) Every function measurable (93) on a set E 
is measurable (2r) on E. (ii) Every function that is semi-continuous 
on a set (2r) is yneasurable (2/ ) on this set. 
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§ 8. Length of a set. We shall define in this § a class 
of functions of a set that are outer Carath^odory measures and 
that play an important part in a number of applications. 

Let a be an arbitrary positive number. Given a set X, we 
shall denote, for each e > 0, by Aa\X) the lower bound of the 

oo 

sums 2[t{Xj)] tt , for which {X,}/= is an arbitrary partition of X 

i=i 

into a sequence of sets that have diameters less than f. When 
f— >0, the number ^«*(X) tends, in a monotone non-decreasing 
manner, to a unique limit (finite or infinite) which we shall denote 
by .d a (X). The function of a set A a (X) thus defined is an outer 
measure in the sense of Carath^odory. For, when f>0, we clearly 
have (i) A^X^A^Y) if XCY, (ii) A^(S^n)^lA ( :\x n ), if{X„) 

n n 

is any sequence of sets, and (iii) d.L f> (X-f Y) = di'^X) -f dl'^Y), 
if q\X, Y) > f. Making f -> 0, (i), (ii), (iii) become respectively 
the three conditions (Ci), (C 2 ), (C 3 ), p. 43, of Carat h^odory for A„(X). 

We shall prove further that the outer measure A a (for any o>0) 
is regular in the sense of § 6, i. e. that every set is regular with 
respect to this measure. We shall even establish a more precise 
result, namely 

(8.1) Theorem. For each set X there is a set H t($>t such that 
XCH and A„(H) = A a (X). 

Proof. For each positive integer n, there is a partition of X 
into a sequence of sets (Xl"*},- 1,2, .. such that 

00 

(8.2) d(X(" ) <1/2 m for i= 1 , 2 ,..., and Y[<5(Xj" , )]“ < A U (X) + 1/n. 

/ - 1 

We can evidently enclose each set X/ n) in an open set G\ n) such that 

(8.3) (5(G<" , )<( l + l/n)<5(X‘ n) ). 

00 °° 

Writing H — // JF G, \ we see at once that H is a set (©,>) and that 

/i 1 1 1 

X^H. Moreover, for each n, H — H and the relations (8.2) 

/ 1 

and (8.3) imply that d (H-G\ n} )<il/n for *=1, 2, ... and that 
A ( J "\H)^Z [ 6(H • <?! n) )]« < (1 +1 /n)" [ A„(X)+l/«]. Making n -> oo, 

we find in the limit A, ( (H)*^L A„(X), and, since the converse in- 
equality is obvious, this completes the proof. 
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In Euclidean M-dimensional space R n (see Chap. Ill), the sets 
whose measure (A„) is zero may be identified with those of measure 
zero in the Lebesgue sense. By analogy, in any metrical space, 
sets whose measure ( A a ) is zero are termed sets having a-dimensional 
volume zero, and in particular, when a = 1, 2, 3, sets of zero length, 
of zero area, of zero volume, respectively. For the same reason, sets 
of finite measure (A, t ) are termed sets of finite a-dimensional volume 
(or of finite length, finite area, finite volume, in the cases «— 1, 2, 3). 
In particular, in Jt lf i. e. on the straight line, the outer measure A t 
coincides with the Lebesgue measure, and, on this account, we call 
the number A 1 (Z), in general, outer length of X, and when X is 
a set measurable (2 1 ,), simply, length of X. For short, we often write A 
instead of A v 

We have mentioned only the more elementary properties of the measures 
A a , those, namely, that we shall have some further occasion to use. For a deeper 
study, the reader should consult F. Hausdorff [1]. Among the researches de- 
voted to the notion of length of sets in Euclidean spaces, special mention must 
be made of the important memoir of A. S. Besicovitch [1]; cf. ako W. Sier- 
pinski [11 and J. Uillis [1], 

§ 9. Complete space. A metrical space is termed complete, 
if a sequence {a„} of its points converges whenever lim Q(a m , a,,)— 0. 

In any metrical space, this is evidently a necessary condition for 
convergence of the sequence {a n ), but, as a rule, not a sufficient 
one. The following theorem concerns a characteristic property of 
complete spaces: 

(9.1) Theorem . In a complete space , when {F n \ is a descending 
sequence of closed and non-empty sets whose diameters tend to zero, the 
common part II F n is not empty . 

n 

Proof. Let a„ be an arbitrarily chosen point of F„. For n^m, 
we have (*(<*»,, a„)^.d(F m ), and hence lim Q(a m ,a,,)= 0. The sequence 

m,n~X>o 

{a„) is thus convergent. Now the limit point of this sequence clearly 
belongs to all the sets F„„ since a n eF„(ZF„, whenever n~^m, and 
since the sets F m are closed by hypothesis. 

(9.2) Baire’8 theorem. In a complete space M, every non-empty 
set ((&<) is of the second category on itself, i. e. if H is a set (©<) 
in M and H=S H„, one at least of the sets H„ is everywhere dense 

n • 

in a portion of H. 
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Proof. Suppose, accordingly, that H = [J @n where G„ are 

n~ 1 

open sets, and further that 

(9.3) H = ZH n 

n—l 

where H„ are non-dense in H. The partial sums of the series 

(9.3) are then also non-dense in H (cf. § 2, p. 41), and it is easy to 
define inductively a descending sequence of portions S„ of H 

n 

such that (i) S„(ZG„, (ii) S„-^Hj= 0, (iii) <5(S„)^l/n. On account 

of Theorem 9.1 and of (iii), the sets S„ have a common point, which 
by (i), belongs to all the sets G n , and so to H, while at the same 
time, by (ii) it belongs to none of the H„. This contradicts (9.3) 
and proves the theorem. 

The case of Theorem 9.2 that occurs most frequently, is that in which H 
is a closed set. For closed sets in Euclidean spaces the theorem was established 
in 1899 by R. Baire [1]. To Baire, we owe also the fundamental applications 
of the theorem, which have brought out the fruitfulness and the importance 
of the result for modern real function theory. As regards the theorem by itself 
however, it was found, almost at the same time and independently by W. F. 
Osgood [1] in connection with some problems concerning functions of a complex 
variable (cf. in this connection, the interesting article by W. H. Young [7]). 
The general form of Theorem 9.2 is due to F. Hausdorff [I, pp. 326 — 328; II, 
pp. 138—146]. 

If a is a non-isolated point (cf. § 2, p. 40) of a set M, the set (a) 
consisting of the single point a is clearly non-dense in M. It there- 
fore follows from Theorem 9.2 that 

(9.4) Theorem . In a complete metrical space, every non-empty 
set (©< >) without isolated points, and in particular every perfect set, 
is non-enumerable. 

More precisely, by a theorem of W. H. Young [l ], every set that fulfills 
the condition of Theorem 9.4 has the power of the continuum; cf. also F. Haus- 
dorff [II, p. 136]. 
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Functions of bounded variation and the 
Lebesgue-Stieltjes integral. 

§ 1. Euclidean spaces. In this chapter, the notions of 
measure that we consider undergo a further specialization. Accord- 
ingly we introduce for Euclidean spaces, a particular class of 
outer measures of Carath^odory, determined in a natural way by 
non-negative additive functions of an interval. These outer meas- 
ures in their turn determine the corresponding classes of meas- 
urable sets and measurable functions, and lead to processes of 
integration usually known as those of Lebesgue-Stieltjes. 

By Euclidean space <>j m dimensions li m , we mean the set of 
all systems of m real numbers (x i} x 2 , ..., x m ). The number Xu is 
termed fc-th coordinate of the point (X|, x 2 , ..., x m ). The point (0, 0 , ..., 0) 
will be denoted by 0. 

By distance g(x,y) of two points x — (x h x,, ..., and 
ym(y h //„,) in the space li m , we mean the non-negative number 
L(,'/i — j'i f + iy-i — x 2 f f- ... f (y„, — \ Distance, thus defined, 
evidently fulfills the three conditions of Chap. II, p. 40, and hence 
Euclidean spaces may be regarded as metrical spaces. All the defi- 
nitions adopted in (’hap. II therefore apply in particular to spaces 
R m . In § 2 we supplement them by some definitions more exclusively 
restricted to Euclidean spaces. 

The space /?, is also termed straight line and the space /? 2 , 
plane. Accordingly, the sets in Jt t will often be called linear, and 
those in /? 2 plane sets. 
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$ 2. Intervals and figures. Suppose given a Euclidean 
space R m . 

The set of the points (a?!, x 2 , ..., x m ) of R m that fulfill a linear 
equation aiXi+a^Xi-j- ...+a m x m =b, where b, a t , a 2 , ..., a.„ are real 
numbers and, of these, the coefficients a t , a 2 , ..., a m do not all vanish 
together, is called hyperplane a\Xi-\-aiXi-\- ...+a m x m =b. For each 
fixed &= 1 , 2 , ..., m, the hyperplanes x k =b are said to be orthogonal 
to the axis of x k . The term hyperplane by itself, will be applied ex- 
clusively to a hyperplane orthogonal to one of the axes. In R v 
hyperplanes coincide with points. In R 2 and R 3 they are respectively 
straight lines and planes. 

Given two points a=(ai, a 2 , ..., a m ) and b--(b\, b>, ..., b m ) 
such that a k ^.b k for As= A, 2 , ... , m, we term dosed interval 
[«i, fti ; ai,bi \ ...; a m ,bm\ the set of all the points (x t , x 2 , ..., x m ) such 
that for fc=l,2, ..., m. The points a and b are called 

principal vertices of this interval. If, in the definition of closed in- 
terval, we replace successively the inequality a* ^ x k ^ b k by the in- 
equalities ( 1 °) «*<»*< b k , ( 2 °) a k ^x k <b k and (3 °)a k <x k ^b k , we 
obtain the definitions ( 1 °) of open interval (a t , b\) a 2 , b 2 ; ...; a„„ b m ), 
( 2 °) of interval half open to the right [a h b,; a 2 , b 2 ; ...; a m , b,„) and ( 3 °) of 
interval half open to the left (ai, 6 t ; a>, b 2 \ ...; a m , & m ]. If a k =b k for 
at least one index k, all these intervals are said to be degenerate. 
In what follows, an interval , by itself, always means either a closed 
non-degenerate interval or an empty set, unless another meaning 
is obvious from the context. 

We call face of the closed interval I=[<i h 6 ,; a?, b 2 \ ...; a„„ 6 ,„] 
the common part of I and any one of the 2m hyperplanes x k —a k 
and x k =b k , where k= 1 , 2 , ..., m. If J is an_open or half open in- 
terval we call faces of J those of its closure J. We see at once that 
the faces of any non-empty interval I are degenerate intervals and 
that their sum is the boundary of the interval I. 

If b\— ai^bi— a 2 =...= 6 m — a m ' 0 , the interval [a,,&i; « 2 , 6 -z;...; a„„ b m ] 
is termed cube ( square in JK 2 ). We define similarly open cubes and 
half open cubes (half open to the right or to the left). 

We call net of closed intervals in R m any system of closed non- 
overlapping intervals that together cover the space R m . Similarly, 
by net of half open intervals, we mean a system of intervals half 
open on the same side, no two of which have common points, and 
whose sum covers R m . A sequence of nets (91*} (of closed or of half 
open intervals) is regular , if each interval of 9l*+> is contained in 
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an interval of 91* and if the characteristic numbers d(9 1*) (cf. 
Chap. II, p. 40) tend to 0 as k-> oo. Given a net of half open inter- 
vals, we clearly change it into a net of closed intervals by replacing 
the half open intervals by their closures. The same operation changes 
any regular sequence of nets of half open intervals into a regular 
sequence of nets of closed intervals. 

(2.1) Theorem. Given an enumerable system of hyperplanes Xk=a it 
where Jfc=l, 2 , ..., m, and j= 1 , 2 ,..., we cam, always construct a 
regular sequence {91*} of nets of cubes ( closed or half open) none of 
which has a face on the given hyperplanes. 

To see this, let b denote a positive number not of the form 
qaj/p where p and q are integers and j= 1, 2, .... Such a number b 
certainly exists, since the set of the numbers of the form qajfp is 
at most enumerable. This being so, for each positive integer Tc let 
us denote by 91* the net consisting of all the cubes half open to the 
left (p t b/2 k , (pi+ 1)5/2*; p 2 5/2*, (p 2 + 1)5/2*; ...; p m 5/2* (p m + 1)5/2*], 
where p \, p 2 , p m are arbitrary integers. The sequence of nets 
{91*} evidently fulfills the required conditions. 

Let us observe that, given a regular sequence {91*} of nets of 
half open [ closed ] intervals , every open set G is expressible as the 
sum of an enumerable system of intervals (91*) without common points 
[non-overlapping]. To see this, let be the set of intervals of 91 x that 
lie in G , and let 9Jl*+i, for each 1, be the set of intervals (91*+ 1 ) 
that lie in G but not in any of the intervals (9JI*). Since d(9l*)~>0 
as k oo, the enumerable system of intervals £ 9J1* covers the set G, 

k 

and the other conditions required are evidently satisfied also. 

On account of Theorem 2.1, we derive at once the following 
proposition which will often be useful to us in the course of this 
Chapter: 

(2.2) Theorem. Given a sequence of hyperplanes {Hi}, every open, 
set G is expressible as the sum of a sequence of half open cubes 
without common points [or of closed non-overlapping cubes] whose 
faces do not lie on any of the hyperplanes Hi. 

A set expressible as the sum of a finite number of intervals 
will be termed elementary figure , or simply, figure. Every sum of 
a finite number of figures is itself a figure, but this is not in general 
the case for the common part, or for the difference, of two figures. 
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We shall therefore define two operations similar to those of multi- 
plication and subtraction of sets, but which differ from the latter 
in that, when we perform them on figures, the result is again a figure. 
These operations will be denoted by © and 0 and are defined by 
the relations 

AOB=(Allj 5 and A 0 B = (I—Bf- 

The relation A©J5 = 0 means that the figures A and B do not 
overlap (cf. Chap. II, p. 40). 

Given an interval /=[«i, br, a>, 62 ; ...; a m , the number 
(bi — a\)-{b L — a 1 )-...-(h m — a,„) will be called volume of I ( length for 
m=l, area for m= 2), and denoted by L (I) or by |Z|. If 1— 0, by 
L(/)=|Z| we mean 0 also. When several spaces R m are considered 
simultaneously, we shall, to prevent any ambiguity, denote the 
volume of an interval T in Ii m by L m (7). We see at once that every 
figure B can be subdivided into a finite number of non-overlapping 
intervals. The sum of the volumes of these intervals is independent 
of the way in which we make this subdivision; it is termed volume 
(length, area) of B and denoted, just as in the case of intervals, 
by L (B) or by lit*!. 

§ 3. Functions of an interval. We shall say that F(l) is a 
function of an interval on a figure It [or in an open set <?], if F(I) is a 
finite real number uniquely defined for each interval I contained 
in R [or in (I |. To simplify the wording, we shall usually suppose 
that functions of an interval are defined in the whole space. 

A function of an interval F(I) will be said to be continuous 
on a figure B, if to each e >0 there corresponds an g > 0 such that 
|ij <»7 implies |F(/)|<e for every interval IQB. A function of 
an interval will be said to be continuous in an open set G, if it is 
continuous on every figure RQG. Finally, functions continuous in 
the whole space will simply be said to be continuous. 

The reader will have noticed that we use the terms “on” (or 
“over”) and “in” in slightly different senses. We may express the 
distinction as follows. Suppose that a certain property (P) of func- 
tions of a point, of an interval, or of a set has been defined on fig- 
ures. We then say that a function has this property in 
an open set G, if it has the property on every figure R<Z G. 
Further, if a function has the property (P) in the whole space, we 
say simply that it has the property (P). Thus, for instance, a function 
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of a set $(X) is additive (33) (cf. Chap. I, § 5 and Chap. II, § 2) 
in an open set G, if it is additive (33) on every figure RQG\ 
if fj is a measure (33) (ef. Chap. I, § 9), a function of a point, de- 
fined in the whole space, is said to be integrable (33, //), if it is 
integrable (33, fi) on every figure, and so on. 

We shall call oscillation O (F-, E) of a function of an interval F 
on a set E, the upper bound of the values \F(I)\ for intervals IQE. 
If D is an arbitrary set and R a figure, we shall denote by on(F;D) 
the lower bound of the numbers O (F; R-G), where G is any open 
set containing Z); the number o r(F;D) will be termed oscillation 
of F on R at the set D. Finally we shall say simply oscillation of F 
at D , and use the notation o(F; D), for the upper bound of the 
numbers Or(F; D) where R denotes any figure (or, what amounts 
to the same, interval or cube). 

In the sequel, D will usually be a hyperplane (a point in If v 
a straight line in f? 2 ) or else the boundary of a figure. In these cases 
we shall say that the function F is continuous, or discontinuous, 
at D on R, according as o K (F; D)=0 or o*(F; Z>)>0. Similarly, 
we shall say that F is continuous, or discontinuous, at D, according 
as o(F; 7>) = 0 or o(F;Z>)>0. 

(3.1) Theorem. In order that a function F of an interval be con- 
tinuous on a figure R [or in the whole space], it is necessary and suf- 
ficient that Or(E';D) = Q [or that o(F-,D) = 0] for every hyper- 
plane D. 

Proof. Since the condition is clearly necessary, let us suppose 
that the function F is not continuous on R. There is then a number 
f 0 ><) and a sequence of intervals (/ <n) — [a{ n) , M"*; ...; «?, 6™’]} 
contained in II and such that for n— 1, 2, ..., \F{I M )\ >*„, and that 
|Z ( " ) |-M). By the second of these conditions, we can extract from 
the sequence {/ <n) ), a subsequence {/*”**}, in such a manner that 
lim (b \ n ^ — «!"**) = 0 for a positive integer i—i 0 ^m. The sequences 

k 

(a|** ) ) A i 2 and 12 then have a common limit point a, 

and, denoting by D the hyperplane $i„=a, we see that every 
open set G^)I) contains an infinity of intervals I { "\ so that 
o n(F; l))^f 0 > <>. 
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§4. Functions of an interval that are additive and 
of bounded variation. A function of an interval F(I) is said to 
be additive on a figure R 0 [or in an open set G ], if F(I 1 +I 2 ) — 
=F(I 1 )-\-F(I 2 ) whenever I l} I 2 and I x -\-l 2 are intervals contained 
in R 0 [or in G ] and I lf I 2 are non-overlapping. A function additive 
in the whole space, is for instance the volume L(Z)=|7|. Just as 
in the case of the function L(7) (cf. § 2, p. 59), every additive 
function of an interval F(I) on a figure R 0 [or in an open set G] 
can be continued on all figures in R 0 [or in G] in such a manner 
that F(R 1 +R 2 )=F(R 1 )-\-F(R 2 ) for every pair of figures R t C R(, and 
R 2 CR 0 that do not overlap. In the sequel, we shall always sup- 
pose every additive function of an interval continued in this way 
on the figures. 

If F is an additive function of an interval on a figure R 0 , we 
shall term respectively upper and lower (relative) variations of F 
on R 0 the upper and lower bounds of F(R) for figures RC.Ru- We 
denote these variations by W (F; R 0 ) and W (F; R 0 ) respectively. 
Since every additive function vanishes on the empty set, we have 
W(F; R 0 )^O^W(F'-,R 0 ). The number W(F;i? 0 )+ W(F;i? 0 )|, clearly 
non-negative, will be called absolute variation of F on R 0 and de- 
noted by W (F-, R 0 ). If W(F; F 0 )<+cx 3, the function F is said 
to be of bounded variation on R 0 . In accordance with the convention 
of § 3, p. 59, an additive function of an interval in the whole space 
is of bounded variation, if it is of bounded variation on every figure. 

It is obvious that a function of bounded variation on a figure R 0 
is equally so on every figure contained in R 0 , and also that the sum, 
the difference, and, more generally, any linear combination of two 
additive functions of an interval that are of bounded variation on 
a figure, is itself of bounded variation on the same figure. 

An additive function whose values are of constant sign is 
termed monotone. A non-negative monotone additive function is 
also termed non-decreasing (for the same reason as in the case of 
non-negative additive functions of a set, cf. Chap. I, p. 8). Similarly, 
non-positive additive functions are also termed non-increasing. 
Every monotone additive function F of an interval on a figure R 0 
is clearly of bounded variation on R 0 . 

If a function is of bounded variation on a figure R 0 , its relative 
variations on R 0 are evidently finite. Conversely, if for an additive 
function F of an interval on a figure R 0 , one or other of the two 
relative variations is finite, then both are finite, and therefore the 
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absolute variation is finite. For, if W(F; i£ 0 )<+oo say, then as 
W (F-,R 0 ) is the lower bound of the numbers F(R)=F(R 0 )—F (R 0 QR), 
where R is any figure contained in R 0 , we find W(F; R 0 ) ^ 
>F(R 0 )— W(F-,R 0 )> — oo. Moreover, this last inequality may also be 
written in the form ft(ftJ^W(ft;ft 0 )+W(ft;ft 0 ). Here we replace F 
by — F to derive the opposite inequality and then finally the equality 
F(R 0 ) = W(ft; ft 0 )+W(ft; ft 0 ). Hence any additive function of bounded 
variation is the sum of its two relative variations on any figure for 
which it is defined. This decomposition is termed the Jordan de- 
composition of an additive function of bounded variation, and is 
similar to the Jordan decomposition of an additive function of 
a set (Chap. I, § 6). 

If F is an additive function of bounded variation of an inter- 
val on a figure R 0 , the three monotone functions defined for every 
figure R(2R 0 by the relations 

W(R)=:W(F ; ft), W 1 (R) = W(F; ft) and W 2 (ft)-W(ft; ft), 

are likewise additive on ft 0 and are termed, respectively, absolute , 
upper , and lower variations of F. The first two are non-negative 
and the third non-positive. It therefore follows from the Jordan 
decomposition that every additive function of bounded variation 
on a figure is, on this figure, the difference of two non-decreasing 
functions. The converse is obvious. 

We shall now prove some elementary theorems concerning 
continuity properties of functions of bounded variation. 

(4.1) Theorem. If F is an additive function of an interval , 
of bounded variation on a figure ft 0 , (i) the series JV o R) (F-, D n ) con - 

n 

verges for every sequence ( D n ) of hyperplanes distinct from one another , 
and (ii) there is at most an enumerable infinity of hyperplanes D such 
that o K„(ft; D) > 0. 

Proof. In the proof of (i) we may clearly suppose that the 
hyperplanes D n are orthogonal to the same axis. Consider now the 
first k of these D n . We can associate with them, k non-overlapping 
intervals Ii, I 2 , ..., /*, contained in ft 0 and such that o * o (F; Z> n X 

^\F(I n )\+l/k 2 for n= 1,2, Hence 2o Ki> (F-,D n )^Z\F(I n )\+llk^ 

n ~ 1 n ~ 1 

^ W(ft; ft 0 ) f 1/kj and, since k is an arbitrary positive integer, 

oo 

y; o* 0 (F; D„) ^ W(f ; R 0 ) < + oo. 

/I-l 
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To establish (ii), suppose that there is a non-enumerable in- 
finity of hyperplanes D such that o Kq (F-, Z))>0. There would then 
be a positive number t, such that o Fo (F-,D)~>e for an infinity, 
(which would even be non-enumerable) of hyperplanes. But this 
clearly contradicts part (i) which has just been proved. 

We obtain at once from Theorem 4.1 the following 

(4.2) Theorem. For each additive function of an interval of 
hounded variation, there is at most an enumerable infinity of hyper- 
planes of discontinuity. 

(4.3) Theorem. If F is an additive function of cm interval of 
hounded variation on a figure R 0 and we write W(I) = W (F; 1), 
the relations o Ro (F', D) = 0 and o Ko (Wj D)=0 are equivalent for 
every hyperplane D. 

Proof. Suppose, if possible, that 

(4.4) o* 0 (F;D)=0 and (4.5) o *„(W;D)>e 

for a hyperplane D and a number e > 0. We shall show that it is 
then possible to define a sequence of figures {R n }n= non-over- 
lapping, contained in R 0 , and such that 

(4.6) R n D= 0 and (4.7) \F(R n )\>e/2. 

To see this, suppose defined k non-overlapping figures Rt, Rz , .... Rk 
contained in R 0 , and let (4.6) and (4.7) hold for n=l, 2, ..., k. 
On account of (4.5) there is then an interval I C^o 8UC ^ that I-Rn=0 
for »=1, 2, ..., k, and such that W(Fj I)=W(I) > t. Hence, there 
exists a figure RQI such that |P(i?)| > W(F; /)/2 > f/2. Moreover 
since (4.4) asserts that F is continuous on R 0 at D, we may sup- 
pose that R-D= 0. But if we now choose Rk+i — R, we see that 
the figure i?*+i does not overlap any of the figures R„ for 
and that (4.6) and (4.7) continue to hold for n—k-\-l. 
Having obtained our sequence { R „ }, we conclude from (4.7) 
that W (F; R 0 )^ y \F (R„)\=oo, and this contradicts our hypotheses. 

n 

The conditions (4.4) and (4.5) are thus incompatible, i. e. (4.4) 
implies o Ro (W-, D)~0. And since the converse is obvious, this 
completes the proof. 

From Theorems 4.3 and 3.1, we obtain at once the following 

(4.8) Theorem. In order that an additive function of bounded 
variation on a figure R 0 be continuous on R 0 , it is necessary and suf- 
ficient that its three variations be so. 



64 


CHAPTER III. Functions of bounded variation. 


$ 5. Lebesgue-Stieltjes integral. Lebesgue integral 
and measure. We need hardly point out the analogy between 
additive functions of bounded variation of an interval and additive 
functions of sets. This analogy will be made clearer and deeper in 
the present §, by associating a function U* of a set with each addi- 
tive function U of bounded variation of an interval. In order to 
simplify the wording, we shall suppose that the functions of an 
interval are defined in the whole space. 

Suppose given in the first place, a non-negative additive 
function U of an interval; we then denote for any set E, by U*(E) 
the lower bound of the sums X U (f*)> where {/*} is an arbitrary 

A 

sequence of intervals such that For an arbitrary additive 


function U of bounded variation, with the upper and lower varia- 
tions W x and W 2 , we denote by Wf and ( — W 2 )* the functions 
of a set that correspond to the non-negative functions and — W 2 , 
and we write, by definition, U*—W* — ( — W 2 )*. The function U * 
is thus defined for all sets and is finite for bounded sets. 


When U is non-negative, U* is an outer measure in the sense 
of Carath^odory, i. e. fulfills the three conditions (C,), (C 2 ) and (C 3 ) 
of Chap. II, § 4. Condition (C 3 ) is the only one requiring proof, the 
other two are obvious. Let therefore A and B be any two sets whose 
distance does not vanish, and let e be a positive number. There 
is then a sequence {/„} of intervals such that A+BC.2 It, and 

n 

2]U(I n )^.U*(A+B)At. We may clearly suppose that the intervals 

n 


of the sequence have diameters less than q ( A , B), i. e. that none 
of them contains both points of A and points of B. We then 
have U*(A)+U*(B)^2U{I„)^U*(A+B)+e. This gives the in- 

n 


equality U*(A)+ U*(B)^.U*(A+B) and establishes condition (C 3 ). 

The function U*, determined by a non-negative function of 
an interval, itself determines, since it is an outer Carath6odory 
measure (cf. Chap. II, § 4, p. 46), the class 2u- of the sets measurable 
with respect to V * and the process of integration ( U*). To simplify 
the notation, we shall omit the asterisk and write simply 2u for 
fit/*, integral (U) for integral (U*), measure (U) of a set in- 


stead of measure 


(17*), [fdU instead of ffdU*, 


and so on. 
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185] 

This slight change of notation cannot cause any confusion, since 
the measure U* is uniquely determined by the function of an in- 
terval U. 

When 17 is a general additive function of an interval, of bounded 
variation, we shall understand by Hu the common part of the classes 
2 iv, and 2- w ., where and W 2 denote respectively the upper and 
lower variations of U. A function of a point f(x) will be termed 
integrable (U) on a set E, if f(x) is integrable (W x ) and ( — W 2 ) si- 
multaneously; by its integral ( U) over E we shall mean the number 

J fdW 1 — fjd( — W 2 ), and we write it JfdU as in the case of 

*• n E 

a non-negative function U. This integration with respect to an 
additive function of bounded variation of an interval 
is called Lebesgue-Stieltjes integration. In the case of the integra- 

b 

tion over an interval I — [«, b ] in /?,, we frequently write f fdU 

a 

for J fd U. 

i 

When the function U is continuous, every indefinite integral ( U) 
vanishes, together with the function U*, on the boundary of any 
figure. Consequently, an indefinite integral with respect to a 
continuous function U of bounded variation of an interval is 
additive not only as function of a set (2 u) but also as function 
of an interval. 

The most important case is that in which the given function 
of an interval P is the special function L (cf. § 2, p. 59) that denotes 
the volume of an interval. The outer measure (L) is also termed 
outer Lebesgue measure, and the integral (L), Lebesgue integral, 
while functions integrable (L) are often called, as originally, by 
Lebesgue, summable. The class of sets 2i. will be denoted simply 
by 2. The outer measure (L) of an arbitrary set E is written meas,J?, 
and meas E without the suffix when E is measurable (2). We shall 
also denote this outer measure by \E\ or by L(L’) (or sometimes 
by L „,{E) in tt m ), thus extending to arbitrary sets the notation 
adopted for all figures 11, since for the latter, as we shall see 
(cf. Th. 6.2), the measure (L) coincides with the values L(ft)=|iii|. 
Finally, owing to the special part played by Lebesgue measure 
in the theory of integration and derivation, the terms “measure 
of a set”, “measurable set”, “measurable function”, and 
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so on, will, in the sequel, be understood in the Lebesgue sense, 
whenever another sense has not been explicitly assigned to them. 
We also modify slightly the integral notation for a Lebesgue 


integral; and instead of Jf(x)dL(x) we write Jf(x)dx, or else 


J J' •••J ' / (*Ei> •••) &m) dx{ dxi ... dx n 


when we wish to indicate the 


E 

number of dimensions of the space U m under consideration. This 
brings us back to the classical notation. 

A special part, similar to that of Lebesgue measure in the 
theory of the integral, is played by Borel sets in the theory of ad- 
ditive classes of sets. In the first place, it follows from Theorem 7.4, 
Chap. II, that every class 2u, where U is an additive function of 
bounded variation of an interval, contains all the sets (93). In the 
sequel, we shall agree that additive functions of a set will 
always mean functions additive (93), unless there is explicit reference 
to another additive class of sets. Similarly, additive functions of 
a set that are absolutely continuous (93, L) or singular (93, L), will 
simply be called absolutely continuous or singular. In point of fact, 
Theorem 6.6 below, which asserts that every set measurable (2) 
is the sum of a set (93) and a set of zero measure (L), will show that 
every additive function of a set, absolutely continuous (93, L), can 
be extended in a unique manner to all sets (2) so as to remain ab- 
solutely continuous (2, L). 


The special rdle of the measure (L) and of the sets (93) showed itself already 
during the growth of the theory. Lebesgue measure was the starting point for 
further extensions of the notions of measure and integral, whereas the Borel 
sets were the origin of general theories of additive classes and functions. The 
sets (93) were introduced, with measure (L) defined for them, by E. Borel [I, 
p. 46 — 60] in 1898. But it was not until some years later that H. Lebesgue [1; I], 
by simplifying and extending the definition of measure (L) to all sets (£), made 
clear the importance of this measure for the theory of integration and especially 
for that of derivation of functions. Vide E. Borel [1] and H. Lebesgue [6]. 

We have already seen in § 1 of this book, how, by an apparently very slight 
modification of the classical definition of Riemann, we obtain the Lebesgue in- 
tegral. A similar remark may be made with regard to the relationship of Le- 
besgue measure to the earlier measure of Peano- Jordan. The outer measure 
of Peano-Jordan for a bounded set E is the lower bound of the numbers £ |7/i| 

n 

where {I n } is any finite system of intervals covering E . Lebesgue’ s happy idea 
was to replace in this definition, the finite systems of intervals by enumer- 
able ones. 
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We have given in the text a more general form to Lebesgue’s definition, 
relative to an arbitrary non-negative function of an interval. This relativizing 
of Lebesgue measure is due to J. Radon [1] and to Ch. J. de la Valine -Poussin 
[1; I], The parallel extension of the Lebesgue integral is also due to J. Radon. 
In the text we have termed it Lebesgue -Stieltjes integral; it is sometimes also 
termed Lebesgue- Radon integral or Radon integral. For a systematic 
exposition of the properties of this integral, vide H. Lebesgue [II, Chap. XI]. 
A particularly interesting generalization of the Lebesgue integral, of the Stieltjes 
type, has been given by N. Bary and D. Menchoff[l]; it differs considerably 
from the other generalizations of this type. Finally, for an account of the 
Riemann -Stieltjes integral (which we shall not discuss in this volume) vide 
W. H. Young [2], S. Pollard [1], R. C. Young [1], M. Fr^chet [5] and 
G. Fichtenholz [2]. 

It was again J. Radon [I, p. 1] who pointed out the importance of the 
Lebesgue -Stieltjes integral for certain classical parts of Analysis, particularly 
for potential theory. The modern progress of this theory, which is bound up with 
the theory of subharmonic functions, has shown up still further the fruitfulness 
of the Lebesgue -Stieltjes integral in this branch of Analysis (cf. the memoirs 
of F. Riesz [4] and G. C. Evans [1]). 


§ 6. Measure defined by a non-negative additive 
function of an interval. In this §, U will denote a fixed non- 
negative additive function of an interval. In the preceding § we 
made correspond to any such a function U, an outer Carath 6 o- 
dory measure U*. Besides the properties established in Chap. II 
for all Carath4odory measures, the function of a set U* possesses a 
number of elementary properties of a more special kind which we 
shall investigate in this §. 

(6.1) Lemma. If D denotes a hyperplane or a degenerate interval , 
the relation o(t/;D) = 0 implies U*(D) = 0. 

Proof. Since every hyperplane is the sum of a sequence of 
degenerate intervals (cf. § 2, p. 57), it is enough to prove the lemma 
in the case in which D is a degenerate interval [«i, fti; a lf ...; a m , b m ]. 

Let If be a cube containing D in its interior, 0 an arbitrary 
open set such that DCG, and let 

D f = [ai — f, ai — f, 62 +t; ...; a m — f, b m -\-e], 

where e is any positive number, sufficiently small to ensure 
that D,(2R G. Since D e is then an ordinary closed interval con- 
taining D in its interior, we find U*(D) (D t ) ^0( t/; RG), 
whence t/*(i)Ko R ([/;D) = 0 . 
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(6.2) Theorem, . For every figure R we have 

(6.3) U*(R°) < U ( J2)< V(R), 

and, if the oscillation of U at the boundary of R vanishes, 

(6.4) U*(R°)=U(R)=U*(R). 

In particular therefore, if TJ is a continuous function, the equal- 
ity (6.4) holds for every figure R. 

Proof. In virtue of Theorems 2.2 and 4.2 the set R° is ex- 
pressible as the sum of a sequence of non -overlapping cubes {/*} 
such that the oscillation of V vanishes at all faces of all the l k . 
Hence, by the preceding lemma U*(R°)=2JU*(1°m), and since 

A 

n n 

U(R)^s? 2j U (Ik)t^ Z U*(I k ) for each positive integer n, we get 

£=1 k=-\ 

U*(R ° )< U(R). 

To establish that U (R)<^U*(R), it is enough to show. that 
U (R)^ZU (Ik) for every sequence of intervals {/*} such that R(ZZlh- 

k k 

Now, if {I/,} is such a sequence, we have, by the well-known covering 

\ 

theorem of Borel-Lebesgue, R(ZZli for some sufficiently large 

A— 1 

N N 

value of N. Hence U (R)^ZU (ROl k )^. ZU (7*). 

A- 1 A — ] 

Finally, denoting by B the boundary of R, let us suppose that 
o(U-,B)~ 0. It then follows from Lemma 6.1 that U*(B)= 0, so 
that U*(R) — U*(R' J ), and the equality (6.4) follows at once from (6.3). 

(6.5) Theorem. Given an arbitrary set E and any positive e, there 
is (i) an open set G such that E(^G and U*(G)^.U*(E)+e, (ii) a set 
He(S>i such that ECH and U*(H)=U*(E). 

Proof, re (i). There exists for each e>0, a sequence of inter- 
vals { I„ } such that E(^Xln and that ZU (I n )<^U*(E)-\ f. Hence, 

n n 

writing G—Zl„ , we find, on account of condition (C 2 ) of CaratModory 

n 

(Chap. II, p. 43) and Theorem 6.2, that U*(G)^ZU*(In)^ZU(I„)< ; 
^V*(E)-\-e. 
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re (ii). Let us make correspond to E, for each positive integer n, 
an open set O n containing E and such that U*{G„)*C.U*(E) + lln; 
this is always possible by (i). The set H=f]G„ clearly fulfills our 

n 

requirements and this completes the proof. 

Every set (®<t) is of course measurable (2u). Hence it follows 
at once from Theorem 6.5 that every set is regular (cf. Chap. II, § 6) 
with respect to the outer measure U*, and therefore that this 
measure is itself regular. 

(6.6) Theorem. Each of the following conditions is necessary 
and sufficient for a set E to be measurable (2 u): 

(i) for every e>0 there is an open set G Cl E such that 
U*(G—E)^ t; 

(ii) there is a set (©<>) containing E and differing from E at most 
by a set of measure ( U ) zero', 

(iii) for every e>0 there is a closed set FC~E stick that 
U*(E—F)^ f; 

(iv) there is a set (<$„) contained in E and differing from E 
at most by a set of measure ( U) zero. 

Proof. We shall first prove all these conditions necessary. 
Let E be a set measurable (2c/) and e a positive number. We begin 
by representing E as the sum of a sequence {E n )„_ 1,2,. of sets measur- 
able (2 u) of finite measure; we may write for instance, E„=Z?-S(0 ;n). 
This being so, we associate with each set E„, in accordance with The- 
orem 6.5, an open set G„CE„ such that U*(G„)^U*(E„)-{-fl2". 
Hence, the sets E„ being measurable (2 u), we have U*(G n — E,,)^. f/2" 
for every n, and if we write G=£Gn, we find E C G and U*(G — U)^ 

n 

^XU*(G„ — S„Kc, and this proves condition (i) necessary. 

it 

To prove the necessity of condition (ii), we attach to the given 
set E measurable (2 y) a sequence {Q„\ of open sets such that EQQ„ 
and U*(Q„ — E)^l/n for each n. Writing // =llQ„, we see that 

//*©„, EQH and U*(H—E)= 0. 

Finally, we observe that for any set A, the relation ACCE 
implies VAQE and E — CA—A — CE; further, if A is a set (©) 
or (©,)), the set C A is a set (5) or ($„) respectively. Hence every 
set E measurable (2 u) fulfills conditions (iii) and (iv), since, by the 
results just proved, its complement CE fulfills conditions (i) and (ii). 
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The sufficiency of conditions (ii) and (iv) is evident, since 
sets of measure ( U) zero, and sets (©a) or ($„), are always measur- 
able (2u). 

To establish the suffic'ency of conditions (i) and (iii), we 
need only observe that they imply respectively conditions (ii) 
and (iv). Thus, for instance, if (iii) holds, there is for each positive 
integer n a closed set F„QE such that U*(E — F„)^;i/n. The set 
P—^F„ is therefore a set (fto) contained in E and such that 

n 

U*(E — P)=0. 

From Theorem 6.6 it follows in particular that the general 
form of a set measurable (2u) is B+N, where B is a set measurable 
(©) and N a set of measure (U) zero. In other words, 2u is the 
smallest additive class containing the Borel sets and the sets of 
measure ( U ) zero. It follows that 2 i02 whenever the function of 
an interval V is absolutely continuous (vide §12). 

(6.7) Theorem. For any set E there is a set He(&A containing E 
and such that 

(6.8) U*(H-X)=U*(E- X) for every set X measurable (2 u). 

Proof. It is enough to show that there is a set H ~^)E measurable 
(2 u) for which (6.8) holds. For, by Theorem 6.6 we can always en- 
close such a set H in a set (©a) differing from it by a set of measure 
(V) zero. 

Let us represent E as limit of an ascending sequence (E n ) 
of bounded sets, which are therefore of finite outer measure (I/); 
and let us associate, as we may by Theorem 6.5, with each E„ a set 
H n e($>d such that E„ C#n and U*(H n )=U*(E n ). Then, for every 
set X measurable (2c), 

U*(H„.X)=U*(H n )-^ U*(H n .CX)^V*(E n (— U*(E n -CX)^U'(E n .X); 
from which, writing #=liminf we deduce by means of The- 

n 

orem 9.1 of Chap. I, that 

U*(H-X) < Urn inf U*(H n -X ) < lim U*(E n -X) < V*(E-X), 

n n 

and this implies (6.8), since H = lim inf H n 3 lim E n = E. 

n n 

In the theorems of this § we have supposed the function U of 
an interval to be non-negative. But, by slight changes in the wording, 
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the theorems can easily be extended to arbitrary functions of bounded 
variation. As an example we mention the following theorem which 
corresponds to Theorem 6.5: 

(6.9) Theorem . If F is an additive function of bounded variation 

of an interval , then for any bounded set E and any e>0 there is an 
open set G^)E such that | F*(X) — F*(E) for every bounded set X 

satisfying the condition EQX(^G. 

Proof. Denoting by W l and W 2 two functions of an interval 
that are respectively the upper and the lower variation of F, we 
can, by Theorem 6.5, enclose E in each of two open sets G 1 and G 2 
such that W*(G 1 )^W*(E)+£ and W*(G 2 )^W%(E) — e . Therefore, 
writing G—G l G 2 we have EQ (?; and for any bounded set X 
such that E(ZX(2G, we find 0^W*(X) — W*(E)^e and 
0^W$(E) — W*(X) whence by subtraction \F*(X) — F*(F)| 

Let us still prove a theorem which allows us to regard all non- 
negative additive functions of a set in It m as determined by non- 
negative additive functions of an interval. We recall that, according 
to the conventions of § 3, p. 59 and § 5, p. 66, we always mean by 
additive functions of a set, functions of a set that are additive (93) 
on every figure. 

(6.10) Theorem. Gwen any non-negative additive function 0 of 
a set , there is always a non-negative additive function F of an interval 
such that 0(X)=F*(X) for every bounded set X measurable (93). 

Proof. Let us denote for each interval /=[«], b\\ ...; a mj 6 m ], 
by I the interval (aj, b\\ ...; a m , b m ] half open to the left, and let 
us define the non-negative additive function of an interval by writing 
F(l) — r l>( I) for every interval I. 

This being so, we observe that any bounded open set G can 
be expressed (cf. Theorems 2.2 and 4.2) as the sum of a sequence 
{I n } of non-overlapping intervals at whose faces the oscillation of 
F vanishes; and therefore by Theorem 6.2, 0(G)— £0(1 n )=£F(I n ) — 

n n 

=£F*(I n )^F*(G). Thus the equation 0(X)—F*(X) holds whenever A 

n 

is a bounded open set, and therefore also whenever X is a bounded 
set (©<0, since the latter is expressible as the limit of a descending 
sequence of bounded open sets. It follows further that 0(X)~ 
^F*(A)=() for every bounded set X of measure (F) zero, since, 
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by Theorem 6.6, such a set X can be enclosed in a bounded set (©<*) 
ot measure (F) zero. This completes the proof, since every bounded 
set (®) is, by Theorem 6.6, the difference of a bounded set (©<5) and 
of a set of measure (F) zero. 

The proof of Theorem 6.10 could also be attached to the following general 
theorem concerning functions additive ($3) defined on any metrical space M: 
if two such functions coincide for every open set , they are identical for all sets (S). 
This theorem is easily proved. 

§ 7* Theorems of Lusin and Vitall-Carath^odory. We 

shall establish in this § two theorems concerning the approximation 
to measurable functions by continuous functions and by semi- 
continuous functions. As in the preceding §, U will stand for a non- 
negative additive function of an interval, fixed in any manner for 
the space Il m . 

(7.1) Lusin’s Theorem . In order that a function f(x ), finite 
on a set 25, be measurable (2 u) on E , it is necessary and sufficient thal 
for every € >0, there exists a closed setF(2E such that U*(E — E)<f, and 
on which f(x) is continuous . 

Proof. To show the condition necessary, we suppose f(x) 
finite and measurable (2 u) on E , and we deal first with two partic- 
ular cases: 

(i) f(x) is a simple function on E. The set E is, in this 
case, the sum of a finite sequence E h E 2 , ..., E„ of sets measurable 
(2 u) no two of which have common points, such that f(x) is con- 
stant on each of these sets. By Theorem 6.6 there exists, for each 
set Eij a closed set FtCFi such that U*(E t — E f )<f/n. Writing 

n 

F=£F t , we then have FQE and U*(E — f)<t, and moreover 

1 

the function f(x) is clearly continuous on F. 

(ii) E is a set of finite measure (U). In this case, by 
Theorem 7.4, Chap. I, (applied separately to the non- negative and 
to the non-positive parts of f(x)), there is a sequence {f„(x)), ,=i >2 , ... 
of simple functions, finite and measurable (2 u), that converges on E 
to /( x). By Egoroff’s Theorem (Chap. I, Th. 9.6), this sequence 
converges uniformly on a set PQE, measurable (2{/) and such that 
V*(E — P)<f/2. This set P may further be supposed to be closed, 
on account of Theorem 6.6. Finally, by (i) we can attach to each 
function f„(x), a closed set P„C.E such that U*(E — P„)<f/2 n+l , 



Theorems of Lusin and Vitali-Carath^odory. 


73 


[§7] 

and on which /„(*) is continuous. Hence, writing F=P-fl P,„ 

It 

we get U*(E—F)^U*(E—P) + ZU*(E—P n )^t-, and moreover, all 

n 

the /„( x), and therefore also the function f(x) — lim /„(#), are con- 

n 

tinuous on F, a set which is evidently closed. 

We now come to the general case where E is any set measur- 
able (Ct/). Let E„—E-(8 n — $„_i), where /S o =0 and $„=S(0;») 
for n^l. By (ii), there exists, for each n^l, a closed set Q„(ZE n 
such that U*(E„ — Q„)<«/2", and on which f(x) is continuous. 

Writing F—^Q n , the set F is closed, we have U*(E — 

n — 1 

<ZU*(E n —Q„) <f, and f(x) is continuous on F. 

n 

The proof of the necessity of the condition is thus complete. 
Let us now suppose, conversely, that the condition is satisfied. 
The set E is then expressible as the sum of a set N of measure ( U) 
zero and of a sequence of closed sets on each of which f(x) is 
continuous. The function / is thus measurable (2*/) on N and on 
each of the sets F n (cf. Chap. II, Th. 7.6), and therefore on the whole 
set E. 

For the various proofs of Lusin’s theorem, vide N. Lusin [1J, W. Sier- 
pihski [6] and L. W. Cohen [1]. 

(7.2) Lemma . Given a function f(x ), measurable (2 u) and non- 
negative in the space JR mj there exists , for each f>0, a lower semi- 
continuous function h(x) such that 

(7.3) h(x)^ f{x) at each point x , 
and 

(7.4) J[h(x) — f(x)]dU(x)^e 

Hm 

(where, in accordance with the convention of Chap. I, p. 6, the 
difference h(x ) — ■ f(x ) is to be understood to vanish at any point x 
for which h(x)=f(x)=+ oo). 

Proof, (i) First suppose that f(x) is bounded and vanishes 
outside a bounded set E measurable (fit/). Let t]=e/[l + U*(E)]. 
write Ek=E[xe E; (k — l)r]^f(x)<.kt]] for Jfc=l, 2, ... and we 

JC 

associate with each set 2?* an open set Ok D-®* such that 
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( 7 . 5 ) U*(0„ — 

Further, denoting by c k (x) the characteristic function of G k , we 

oo 

write h(x)=£kr]-c k (x). Since each function c k (x) is evidently lower 

A=1 

semi-con tinuous, the function h(x) is so too. We also observe that 
h(x) fulfills condition (7.3). On the other hand 

oo oo 

f h(x)d U(x)— £kr]fc k (x)dU(x)= ^krj-U*(G k )= 

R m *=» K m *=« 

OO oo 00 

= ]>(k— l)f]-U*(E k ) + JS V -U*(E k ) + 2j kf l-U*(G k —E k ), 

k~\ k=\ ft - 1 


whence, by (7.5), we obtain 


fh(x)dV(x )< Jf(x)dU(x) + *i-U*(E)+ri^ ff(x)dU(x) + e. 


From this, remembering that f(x) is integrable on R m , (7.4) follows 
at once. 

(ii) We now pass to the general case and represent firstly f(x) 

oo 

in the form f(x)=2f„(x), where the f„(x) are bounded non-negative 

n— 1 

functions measurable (2 v ), each of which vanishes outside a bounded 
set. We may do this, for instance, by writing f n (x) = s n {x) — s„-\(x) where 


s„{x) = 


f(x) for Q(0,x)<n and f(x)^.n, 

n for p(0, x)<.n and f(x)">n, 

0 for p(0, x)^n 


n= 0, 1, 2, ... 


By what has been proved in (i), there exists for each function f„(x ) 
a lower semi-continuous function h„(x) such that h„(x)^f n (x) at 
every point x, and that / [h„(x) — f n (x)] dV (x) <f/2". The function 

lim 

OO 

h(x)=2 h n (x) is then evidently lower semi-continuous and fulfills con- 

n~l 

oo 

dition (7.3). Finally f[h(x) — f(x)]dU(x)^ ^ J[h„(x)—f„(x)]dU(x)^.e t 
Km n=l Km 

and this completes the proof. 



Theorems of Lusin and Vitali-Carath6odory. 


75 


l§7] 

(7.6) Theorem of Vitali-Carathtodory . Given a function f(x) 
measurable (2u) m the space R mj there exist two monotone sequences 
of functions {?„(#)) and { u n (x )} for which the following conditions are 
satisfied: 

(i) the functions l n are lower semi-continuous and the functions 
u n are upper semi-continuous , 

(ii) each of the functions l n is bounded below and each of the 
functions u n is bounded above , 

(iii) the sequence (Z„) is non-increasing and the sequence ( u n ) is 
non-decreasing , 

(iv) l n {%)^f(x)^u n (x) for every x , 

(v) \iml n (x)=f(x) = \imu n (xy almost everywhere ( U ), 

n n 

(vi) on every set E on which f(x) is integrable (17), so are the 
functions l n {%) and u n (x) and we have 

lim f l„(x) dt/(a;)=lim fu n (x) dU(x)= ff(x) dU(x). 
n k " k k 

Proof. By expressing the function f(x) as the sum of its 

o 

non-negative and non-positive parts f(x) and f(x) (Chap. I, § 7), 

o 

we may suppose that f(x) is of constant sign, say non-negative. 
By the preceding lemma, we can associate with f(x) a sequence 
of lower semi-continuous functions {h„(x))„^, i, 2 ,.. such that h n (x)^f(x) 
for every x and 

(7.7) ]im[[h n (x)—f(x)]dU(x) = 0. 

n R 

Writing l n (x ) — min [M#), •••> h„(x)] we therefore obtain a non- 

increasing sequence of lower semi-continuous functions ((„(#)} 
that evidently fulfills conditions (i), (ii), (iii) and (iv); moreover, 
it follows from (7.7) that lim / [ l n (x ) — f(x)]dU (x) = 0, and hence 

" Rm 

that the functions l n (x) fulfill also conditions (v) and (vi). 

In order now to define the sequence [u„(x)}, we attach to the 
function l/f(x) a non-increasing sequence of lower semi-continuous 
functions {g n (%)} such that lim g„(x) = l/f(x) almost everywhere ( U). 

n 

Such a sequence certainly exists by what has just been proved. 
The functions l/g„(x) then form a non-decreasing sequence of upper 
semi-continuous functions, that converges almost everywhere (U) 
to f(x). If we now write u n (x) = l/g„(x) when l/g n (x)^.n, and 
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u„(x)=n when 1 /</„(#)> w, we obtain a sequence {u n (xj) of bounded 
functions with the same properties, which therefore satisfies con- 
ditions (i — v). Finally, since the functions u n (x) are non-negative, 
we can apply Lebesgue’s Theorem (Chap. I, Th. 12.6) to derive 
from (iii) and (v) that lim / u n (x)dU (x)= f f(x)dU (x) on every 

n E K 

set E measurable (2t/), and this implies (vi). 

Conditions (i) and (v) of Theorem 7.6 imply that every function measur- 
able (2&) is almost everywhere (U) the limit of a convergent sequence (with 
finite or infinite limit) of semi-continuous functions, and thus coincides almost 
everywhere (U) with a function of the second class of Baire. This result, due 
to G. V itali [2] (cf. also W. Sierpinski [6]) was completed by C. Carath^odory 
[I, p. 406], who established for every measurable function f(x) the existence 
of two sequences of functions fulfilling conditions (i) — (v). Condition (vi), which 
includes, as we shall see later, the theorem of de la Valle© Poussin and Perron 
on the existence, for summable functions, of majorant and minorant 
functions, has been added here because its proof is naturally related to those 
of conditions (i — v). 

There is an obvious analogy between the property of measurable functions 
expressed by the theorem of Vitali-Caratheodory, and the properties of meas- 
urable sets stated in conditions (i) and (iii) of Theorem 6.6. By taking into ac- 
count the geometrical definition of the integral (cf. below § 10), we might even 
base the proof of Theorem 7.6 directly on Theorem 6.6 (vide the first ed. of this 
book, pp. 88 — 91). 

§ 8* Theorem of Fubini. Given two Euclidean spaces R p 
and R q , if x—(a\, a 2 , a p ) and y—(a p +\, a p + 2 , ...» a P + q ) are two 
points situated respectively in these two spaces, we shall denote 
by (x, y) the point (a ( , a 2 , a p+q ) in the space R,,+ q ■ If X and ¥ 
are two sets situated respectively in the spaces R p and R q , we shall 
denote by Ixl the set of all points (x, y) in R p + q such that xeX 
and ye Y. In particular, if X and Y are two intervals — closed, open, 
or half open on the same side — XxY also is an interval in R p+q , 
which is closed, open, or half open on the same side as X and Y. 
Every interval I=[a { , b r, a p+q , b p+q ] can evidently be expressed — 
and in a unique manner — in the form I\x I 2 where I 1 and I 2 
are intervals in R p and R q respectively; we merely have to write 
Ji=[di, b\j <ip, bp] and I 2 =[a p +i, b p + 1 ; n p + q i 

Given two additive functions of an interval, U and V, in the 
spaces R p and R q respectively, we determine a function of an inter- 
val T in R p + q by writing T(/ 1 x/ 2 ) = f/(/ 1 )- F(7 2 ) for each pair of 
intervals I\C.R P and hQRq. The function T thus defined, clearly 
additive when U and V are, will be denoted by UV. In particular, 
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we see easily that L P+<7 =L P L 7 , where L PJ L q and L p+<7 denote 
the volume in the spaces R PJ R q and R p + q respectively (cf . § 2, p. 59). 

It is known since Cauchy that, if I x and I 2 are respectively 
two intervals in the spaces R p and R q} integration of any continuous 
function over the interval I\Xl 2 C.R P + q may be reduced to two 
successive integrations over the intervals I I and Z 2 . By repeating 
the process, any integral of a continuous function on an ra-dimensional 
interval may be reduced to m successive integrations on linear 
intervals in R v This classical theorem was extended by H. Lebes- 
gue [1] to functions measurable (2) that are bounded, and then by 
G. Fubini [1] (cf. also L. Tonelli [2]) to all functions integrable (L), 
whether bounded or not. We shall state this result in the follow- 
ing form: 


(8.1) Fubini’8 Theorem,* Suppose given two non-negative ad- 
ditive functions U and V of an interval in the spaces R p and R q 
respectively , and let f(x,y) be a non-negative function measurable (2 uv) 
in R p + q . Then 

(h) V) i s a function of x , measurable (2 u) in R P for every 
yeRqj except at most a set of measure ( V) zero , 

(i 2 ) f( x i V) a function of y , measurable (2v) in R q for every 
xeR p , except at most a set of measure (U) zero , 


(ii) f f(x, y) dUV(x, y) = f [ j f(x, y) dU(x)] dV(y) = 

**p+q ^ R P 

= [ [ff(x, y) dV(y)]dU(x). 

K K o 

Proof. Let us write for short, T=UV. By symmetry, it is 
enough to show that every non-negative function f(x,y) measurable 
(2 t) in Itp+q fulfills condition (i x ) and also the relation 

(8.2) ff(x, y) dT(x, y)=/ [/ f(x, y) dU(x)] dV(y). 

^ p-j-g ttp 

For brevity, we shall say that a function f(%,y) in R P + q has the 
property (F), if it is non-negative and measurable (2 r) in R P + qj and 
if it fulfills condition (i x ) and the relation (8.2). For the sake of 
clearness, the reasoning that follows is divided into a several 
auxiliary propositions. 
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(8.3) The sum of two functions with the property ( F ), and the 
limit of any non-decreasing sequence of such functions , have the prop - 
erty (F). Also , the difference of two functions with the property ( F ), 
has the property (F), provided that it is non-negative and that one at 
least of the given functions is finite and integrable (T) on the space 

Hp+q. 

For the sum, and for the difference, of two functions, the 
statement is obvious. Let therefore {h n (x,y)} be a non-decreasing 
sequence of functions in H p ^ q having the property (F), and let 
h(x, y) = lim h n (x , y ). The definite integrals f h n (x , y)dU(x) exist, 

n 

p 

and constitute a non-decreasing sequence, for every y e li q except 
at most those of a set of measure ( V ) zero. Consequently, by 
Lebesgue’s theorem on integration of monotone sequences of functions: 


R 


fh(x,y)dT(x,y)= lim f h n (x, y) dT(x, y)=lim f | / h„(x,y) dU(x )] dV(y)= 
"k. n k k 

P+Q Q P 

= f [hm fh„(x, y) dU(x) 1 dV(y) = f [f h(x, y) dU(x) ] dV(y), 


/f v r p 


and this establishes the property (F) for the function h(x, y). 


(8.4) The characteristic function of any set K (Z k f , , q measurable (Mr) 
has the property (F). 


We shall establish this, first for very special sets E and then, 
by successive stages, for general measurable sets. Suppose in the 
first place that 

1 0 E=AxB, where A and B are intervals half open 
to the left, situated respectively in R p and lt q , and such 
that the oscillations of U and of V vanish at the bound- 
aries of A and B respectively (cf. § 3, p. 60). The oscillation 
of the function T=UV therefore vanishes at the boundary of the 
interval E=AxB, and we find by Theorem 6.2 

(8.5) T*(E)=T(E) = U(A) • F(B)= U*(A) ■ V*(B). 


On the other hand, for every y e R q the function Ce(%, y) 
is in x the characteristic function either of the half open inter- 
val A, or of the empty set, according as yeB or y e R q — B. This 
function is therefore measurable (2u), and indeed measurable (33), 
for every ye.R q , and, by (8.5) 



Theorem of Fubini. 


79 


[§ 8 ] 

[c B (x,y)dT(x,y)=T*(E)=U*(A)-V*(B)= J [ / c B (x, y) dU(x)]dV(y). 

V, K 

2° E is an open set. We shall begin by showing that, in 
this case, E is the sum of a sequence of half open intervals { I „ } no 
two of which have common points, these intervals I„ being of the 
form A„xB„ where (a) A n and B n are intervals, half open to the 
left, situated in lt p and R q respectively, and (b) the oscillations of U 
and V vanish at the boundaries of A n and B„ respectively. 

To see this, let (U w ) be a regular sequence of nets in R p formed 
of intervals half open to the left and such that the oscillation of V 
vanishes at the boundary of each of these intervals; by Theorems 
4.2 and 2.1, such a sequence certainly exists. And let (Q3***) be 
a sequence of nets similarly constructed for the space U q and for 
the function V. We denote, for each k, by the system of all 
half open intervals in R P + q which are of the form AxB where AeU w 
and The systems of intervals Z (k \ thus defined, form a reg- 

ular sequence of nets of half open intervals in the space R p •. The 
set E being open, we can therefore express it (cf. § 2, p. 58) as the 
sum of a sequence of half open intervals (/„} taken from the nets 
and without points in common to any two. We see at once that 
each interval I„ of this sequence is of the form A„xB„ where A„ 
and B„ satisfy conditions (a) and (b). 

This being so, we have c B (x, y) = V c/ n (x, y), where on account 

n 

of the result established for the case 1°, each of the characteristic 
functions c Jfi (x,y) has the property (F). Therefore, to verify that 
the function e B (x,y) also has this property, we need only apply (8.3). 

3° E is a set (<fM). First suppose that, besides, the set E is 
bounded. E is then the limit of a descending sequence of bounded 
open sets {(?„}. The functions c 0| (x, y ) — c a n (x,y) constitute a non- 
decreasing sequence of non-negative functions which have, by 2° 
and (8.3), the property (F). Consequently, again on account ot (8.3), 
the limit function of this sequence h(x,y)=C 3 l (x,y) — c, B (x,y) itself 
has the property (F) and the same is therefore true of the function 
c.s(*, y)r=c 0l (x, y)—h(x, y). 

Now if E is an arbitrary set (©,>), we can express it as the limit 
of an ascending sequence ( H n } of bounded sets (©«). By what has 
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just been proved, the characteristic functions of the sets H„ have the 
property ( F ) and, consequently, the function CE(x,y)=limcH n (x,y) 
itself has- the property (F). " 


4° E is a set of measure (T) zero. There is then, by 
Theorem 6.5, a set containing E and of measure (T) zero. 

By the result established for sets (©a), the function c H(x,y) has 

the property (F), and therefore J [f (x, y) dU(x)]dV(y) = 

« <, k p 

=Jch(x, y) dT(x, y)—T*(H) = 0. Hence, for every yeR q , except at 

Rp+q 

most a set F of measure (F) zero, j c H (x,y)dV (x)=0, i. e. c H (x,y), 

K 

as function of x, vanishes almost everywhere (U) in R p . Hence, 
a fortiori, CE(x,y)^CH(x,y) as function of x, vanishes almost every- 
where (U), and is consequently measurable (£u), for all yeR q , except 
at most for those of the set Y of measure (F) zero. Finally, we 


clearly have J[fc E (x,y)dU (x)]dV(y)=0=T*(E)= J c E (x,y)dT(x,y). 


It R„ 

Q P 


The function c E (x,y) thus has the property (F). 

On account of Theorem 6.6 every set E measurable (Er) is 
expressible in the form E=H — Q, where H is a set (©a) and Q 
is a set of measure (T) zero contained in H. We thus have c £(x,y) = 
= c H(x,y) — c o(x,y), and by (8.3) the proposition (8.4) reduces to 
the special cases 3° and 4° already treated. 

The proposition (8.4) being thus established, let f(x, y) be 
any non-negative function measurable (2y) in the space R p + q . By 
Theorem 7.4, Chap. T, the function / is the limit of a non-decreasing 
sequence of simple functions, finite, non-negative, and measurable (fir). 
Nbw each of these simple functions is a linear combination, with 
positive coefficients, of a finite number of characteristic functions 
of sets measurable (2 t), and therefore has the property (F) on ac- 
count of (8.4). Thus the function / is the limit of a non-decreasing 
sequence of functions with the property (F), and so, by (8.3), / itself 
has the property (F). This completes the proof of Theorem 8.1. 


Let us make special mention of the particular case of the 
theorem in which f(x, y) is the characteristic function of a meas- 
urable set: 
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(8.6) Theorem. If V and V are two non-negative additive 
functions of an interval in the spaces R p and R q respectively, and if Q 
is a set measurable (2uv) in the space R p + q , then 

(ij) the set E[(a5, y) e$] is measurable (2 u) for every y e R q , 

X 

except at most a set of measure ( V) zero , 

(i 2 ) the set E [(x,y)eQ] is measurable (2v) for every xeR p , 
v 

except at most a set of measure (U) zero , and 


(ii) the measure {TJV) of Q is equal to 

fu*{E[(x,y)eQ ]> dV(y) =/ F* { E [(a?, y) e <?]> dV(x). 

* q * * p u 
Fubini’s theorem is frequently stated in the following form: 


(8.7) Theorem . Let U and V be two additive functions of bounded 
variation of an interval in the spaces Jt p and R q respectively. Then 
for every function f(x y y) integrable {TJV) on R p + qj the relation (ii) of 
theorem 8.1 holds good and the function f{x,y) is integrable (U) 
in x on R p for every yeR qj except at most a set of measure ( V ) zero f 
and integrable { V) in y on R q for every x e n P , except at most a set 
of measure (U) zero. 


We reduce this statement at once to that of Theorem 8.1 by 
expressing the function / as the sum of its non-negative and non- 
positive parts, and by applying to the functions of an interval U 
and V the Jordan decomposition (§4, p. 62). 


Further generalizations of Fubini’s theorem for the Lebesgue-Stieltjes 
integration (in particular including the theorems analogous to Theorem 15.1 of 
Chap. I) were studied by L. C. Young in his Fellowship Dissertation (Cam- 
bridge 1931, unpublished). An account of these researches will be given in the bool 
The theory of Stieltjes integrals and distribution-functions by L.C. Young (Oxford, 
Clarendon Press). 

It > folio ws in particular from Theorem 8.6 that for any set Q measurable 
in the sense of Lebesgue in the space R P +q, its measure (Lp+g) is given by the 
definite integrals Ij p {E[(x,y)eQ'\}dL q (y)= I L 9 (E [(x,y) eQ]}dljp{x). It is never- 

J x u 

R o R p 

theless to be remarked that the existence of these two integrals does not in 
general enable us to draw any conclusion as to measurability (£) of the set Q. 
W. Sierpinski [5] has in fact constructed in the plane a set non -measurable (^) 
having exactly one point in common with every parallel to the axes. This con- 
struction depends, needless to say, on the axiom of selection of Zermelo. 

For an interesting discussion of Fubini’s theorem for Lebesgue integration 
of functions of variable sign, vide G. Fichtenholz [1]. 
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We complete the theorems of this § by the following 

(8.8) Theorem. If Q is a set measurable (93) in the space R p + q , 
the set E[(», y) e@] is measurable (93) in the space It p for every y e R q , 

x 

and the set E [{x,y)eQ'\ is measurable (93) in R q for every xsR p . 
y 

Similarly, if a function f(x,y) is measurable (93) in the space R p + q , 
then in R p the function f(x, y) is measurable (93) in x for every y e ll q , 
and in R q the function f(x,y) is measurable (93) in y for every xeR p . 

Proof. It will be enough to prove the first half of the theorem, 
since the second half obviously follows from the first. Let us denote 
by 93 0 the class of all sets Q in R P + q such that the sets E[(a?, y)tQ], 

X 

for every yeR q , and the sets E[(#, y)eQ], for every xeR p , are 

y 

measurable (93) in the spaces R p and R q respectively. If a set Q(ZR P +q 
is closed, so are the sets E[(x, y)eQ] and E[(#, y)eQ]. The class 93 0 

x y 

thus contains all closed sets of the space R p + q , and on the other 
hand we see at once that 93 0 is additive. It follows that 33 0 includes 
all Borel sets in the space R p + q (cf. the definition, Chap. II, p. 41), 
and this completes the proof. 

* § 9. Fubinl’s theorem in abstract spaces. We shall 
return in this § to the abstract considerations of Chap. I and show 
that for abstract spaces, theorems similar to those of the preceding § 
hold good. 

Given any two sets X and Y, we shall denote by X x Y the 
set of all pairs of elements (x, y) for which x (■ X and yeY. The 
set Ixl is often called combinatory product or Cartesian product 
(cf. C. Kuratowski [I, p. 7]) of the sets X and Y. The following 
identities are obvious 

(9.1 ) (I 1 xr 1 ).(Y 8 xr ! )=(I 1 .I J )x(Y 1 .Y ! ), 

(9.2) (X 2 X Y 2 )-(Z lX Y t )=[(Xr- X t )X Y 2 ]+[(Y 2 .Y,)x ( Y 2 -Y,)], 

the sets X v X 2 , ¥ v Y 2 being quite arbitrary. 

If SE and ?) are additive classes of sets in the spaces X and Y 
respectively, SE^) will denote the smallest additive class of sets 
in the space X x 1', containing all product-sets of the form X x Y, 
where Xe9E and Ye?). 
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For auxiliary purposes, we shall make use in this § of 
the following definition: a class 91 of sets wiH be termed normal, 
if (i) the sum of every sequence of sets (91) no two of which have 
common points is itself a set (91) and (ii) the limit of every 
descending sequence of sets (91) is a set (91). 

We shall begin by proving the following analogue of Theorem 8.8: 

(9.3) Theorem. Let 9f and 9) be two additive classes of sets in 
the spaces X and Y respectively. Then, if Q is a set measurable (9E9)) 
in the space Xx Y, the set E[(x, y)<-Q] is measurable (9 E) for every 

X 

yeVj and the set E [(x,y)eQ] is measurable (?)) for every a; eX. 

if 

In the same way a function f(x , y) which is measurable (SE?)) 
in the space X xT, is measurable (9E) in x for every y e Y and meas- 
urable ($)) in y for every xeX. 

Proof. It is enough to prove only the first part concerning 
sets. To do this, we denote by SP1 the class of all sets Q in X x Y 
such that the set E[(#, y)eQ] is measurable (9E) for every y eY, 

X 

and that the set HJ[(x,^)eQ] is measurable (?)) for every xeX We see 

y 

at once that the class 991 is additive in the space X x 1' and that, 
besides, it includes all sets XxY for which Xe9f and Ye?). Hence 
and this completes the proof. 

Before proceeding further we shall establish the following 
lemma: 

(9.4) Lemma. If 9E and 9) are two additive classes of sets in 
the spaces X and 1' respectively, the class XT) coincides with the smal- 
lest normal class that includes the sets XxY for which X e9C and Y e 9). 

Proof. -For brevity let us term elementary any set X x Y 
for which X <-■ 9c and Ye?), and let 91 0 denote the smallest normal 
class which includes all elementary sets (i. e. the common part 
of all the normal classes that include these sets). Clearly 91 0 C 9^2) 
since 9f9) is also a normal class. In order to establish the opposite 
inclusion, it is enough to prove that the class 91 0 is additive, and this 
will be an immediate consequence of the following two properties of 
the class 91 0 -‘ 
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(9.5) The common part of any sequence of sets (9t 0 ) is itself a set (9l 0 ). 

(9.6) The complement (with respect to the space XxY) of any set (9t 0 ) 
is again a set (9t 0 ). 

To prove (9.5), it is enough, since the class 9t 0 is normal, to 
show that the common part of two sets (9l 0 ) is a set (9t 0 ). 

For this purpose, let 9^ be the class of all the sets (9l 0 ) whose 
common parts with every elementary set belong to 9t 0 . 
From the identity (9.1), it follows that the common part of two 
elementary sets is an elementary set, and hence that includes 
all the elementary sets. On the other hand, we verify at once that 
is a normal class. This gives an( l since by definition 

C 9t 0 , we obtain 9l 1 =91 0 . 

Let now 91 2 be the class of all the sets (9l 0 ) whose common 
parts with every set (91 0 ) belong to 9l 0 - Since the class 

9t 2 includes all elementary sets. Furthermore, 91 2 is clearly a normal 
class. We therefore have 91 2 =9? 0 , and this proves (9.5). 

To establish (9.6), let 9I 3 be the class of all the sets (9l 0 ) whose 
complements are also sets (9t 0 ). On account of the identity (9.2) 
the complement of any elementary set is the sum of two elementary 
sets without common points, and so, a set (9l 0 ). Therefore the class 9l 3 
includes all elementary sets and, to conclude that 9l 3 =9l 0 > ^ suffices 
to show that the class 9l 3 is normal. 

Let therefore be any sequence of sets (9l 3 ) without com- 
mon points to any two of them, and let X be the sum of the se- 
quence. The set X clearly belongs to the class 9i 0 . On the other 
hand, the sets CX n are, by hypothesis, sets (9l 0 ); 80 that, by (9.5), 
the same is true of their product CX—JJCX n . Thus we have at 

n 

the same time, X e 91 0 and CX e 9l 0 , and therefore X e 9l 3 . 

Again, let ! Y n ) n ^ 2 y ... be a descending sequence of sets (9? 3 ), 
and Y its limit. The set Y clearly belongs to the class 9t 0 . On the 
other hand, consider the identity 

oo oo 

CY = ZCY n = CYi + Eln-CY n+i , 

n 1 n - 1 

and observe that no two of the sets CYi and Y„ Cr„+i for n= 1,2, ... 
have common points. Since these sets belong, by (9.5), to the class 91 0 , 
so does the set C Y. Thus we have both Y e 91 0 and C Ye 91 0 , whence 
Ye9l 3 . The class 91 3 is therefore normal, and this establishes (9.6) 
and completes the proof of Lemma 9.4. 
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We can restate Lemma 9.4 in the following more general form: 

(9.7) Given in an abstract space T a class Q of sets additive in the weak 
sense , then the smallest class that is additive (in the complete sense) and contains Q. 
coincides with the smallest normal class containing Q. 

The proof is the same as for Lemma 9.4. 

If and $) are additive classes in the spaces X and l r respectively, the 
finite sums of the sets XxY for which X e# and Ye $), constitute, according to 
formulae (9.1) and (9.2), a class that is additive in the weak sense ( vide Chap. I, 
p. 7) in the space XxY Another example of a class of sets additive in the 
weak sense consists of the class of all the sets of an arbitrary metrical space M, 
that are both sets ((&,}) and (JJo). The smallest class that is additive (in the com- 
plete sense) and contains these sets is clearly the class of Borel sets in 31. 

The assertion of (9.7) enables us to prove easily the following theorem 
due to H. Hahn (2, p. 437J and in some respect analogous to Theorem 6.6: 

Let Q be a class of sets . additive in the weak sense in a space 1\ and let T be 
the smallest class of sets that is additive in the complete sense and contains Q. Sup- 
pose further that r is a measure ($) such that the space T either has finite measure (r), 
or, more generally , is expressible as the sum of a sequence of sets of finite measure (r). 
Then (i) for every set E measurable (%) and for every there exists a set Ee 

and a set Ge Q a , such that FC^ECjG and that i (E — F) e and i{G —E)^e; 
(ii) for every set FJ measurable {%) there exist a set (Q^) contained in E t and a set 
(Qfirt) containing E % which differ from E at most by sets of measure (z) zero. 


(9.8) Theorem . Let 9E and ^ be additive classes of sets in the 
spaces X and Y respectively , and let ft and v be measures defined 
respectively for these classes . Suppose that p( A)<oo and v{ K)<oo, 
or, more generally , that 


(9.9) 

where X„e 9E, 


a'=2X, y=lY„ 

n n 

//( A'„)<oo and »’(r„)<oo for n— 1,2.... 


Then, for every set QC_Xx 1' measurable (9E?)) r (i) 

X 

as function of y , is measurable (?)) in the space Y a?id r{E[(#,jy)e<?]}, 

y 

as function of x , is measurable (9E) in the space X] furthermore 


(ii) m[nl®[(x,y)tQ]}dr(y) = m fv{E[(x,y)vQ])d(iU)- 

V x X l ' 

Proof. We may clearly suppose that no two sets A’,„ and 
also no two sets Y,„ have common points. The same will then he 
true of the sets A„x Y„, in the space A'x 1”. 

Let us denote by 91 the class of all the sets P measurable (9E9)) 
in the space A'x.F, such that conditions (i) ami (ii) of the theorem 
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hold good for ©very set Q=P-(X„x Y m ) where re and m are arbitrary 
positive integers. Since Q=£Q(X„x Y m ) for every set QQXx V, 

n,m 

and since no two of the sets X„x Y m have common points, it follows 
easily from Lebesgue’s Theorem 1213, Chap. I, that every set Q 
belonging to the class 91 fulfills conditions (i) and (>i). We have to 
prove that this class includes all sets measurable (9E?))- 

To do this, we observe that it follows at once from the identity 
(9.1) that every set Xx Y for which X eX and Ye 9), belongs to 91. 
On the other hand, since by hypothesis /< (Y„)<oo and c( Y„)< oo 
for every re, we easily deduce from Lebesgue’s Theorems 12.3 and 12.1 1 
Chap. I, that the sum of any sequence of sets (91) no two of which 
have common points, and the limit of any descending sequence 
of sets (91), are themselves sets (91). The class 91 is therefore normal, 
and by Lemma 9.4, contains all sets (9E?)). This proves the theorem. 

If we suppose the hypotheses of Theorem 9.8 satisfied, a meas- 
ure can be defined for the class 9t?) so as to correspond naturally 
to the measures /< and v that are given for the classes 9E and ?)• 
We do this by calling measure (fir) of a set Q measurable (9E?)) the 
common value of the integrals (ii) of Theorem 9.8. It is immediate 
that we then have pv(XxY)=p( X)-v(Y) for every pair of sets 
A r eS and Ye?). 

This definition enables us to state Theorem 9.8 in a manner 
analogous to Theorem 8.6. But the analogy would be incomplete 
if we neglected to extend at the same time the class 9E?). Thus, 
for instance if 9E and ?) denote respectively the classes of sets meas- 
urable in the Lebesgue sense in Euclidean spaces li,, and I{,„ the 
class SE?) does not coincide with that of the sets measurable (2) 
in Hp + q , although it is evidently included in the latter. The extension 
of the class Y?), that we require in the general case, will lx* defined 
as follows. 

Given an additive class of sets 2 and a measure t associated 
with this class, we shall call the class 2 complete with respect to the 
measure i if it includes all subsets of sets (2) of measure (i) zero. 
Thus for instance, if /’ denotes any measure of Oarath^odory, the 
class Si/- is complete with respect to F (cf. (’hap. II, p. 44), and in 
particular, the class 2 in a Euclidean space is complete with respect 
to Lebesgue measure; whereas the class of sets measurable (93) is 
not complete with respect to that measure. 
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l»3 


Every additive class of sets Z may be completed with respect 
to any measure t defined for the class, i. e. there is always an ad- 
ditive class such that the function of a set t can be continued 

as a measure on all sets (S) and such that S is complete with respect 
to the measure / thus continued. Among the classes 6 of this kind, 
there is a smallest one that we shall denote by 2'. As is seen di- 
rectly, this class consists of all sets of the form T — where 
TeZ and N u jV 2 are arbitrary subsets of sets (2) of measure (t) 
zero. The extension of the measure % to all sets of this form is evident. 

We can now state the following theorem which corresponds 
to Fubini’s Theorem 8.1: 

(9.10) Theorem. Under the hypotheses of Theorem 9.8, if f(x, y) 
is a non-negative function measurable ( in the space Xx 1, 

(h) f( x j V ) as function of x is measurable ( S X") in -V for every 
yeV, except at most a set of measure (v) zero-, 

(i 2 ) f(x, y) as function of y is measurable (T)‘) in V for every 
xeX, except at most a set of measure (,«) zero-, 

(ii) / f(x,y) dnv(x,y)= j [ f f(x,y)dp(x)\dv(y) = / [ j f(x,y) dv(y) ]dp(x). 
xx r i - x x i' 

Proof. In the special case in which / is the characteristic 
function of a set measurable (X'l)), the theorem is an immediate 
consequence of Theorem 9.8. The same _is_ true when f is the cha- 
racteristic function of a set measurable (X^)" 1 ) of measure (fiv) zero, 
and in consequence Theorem 9.8 remains true when / is the cha- 
racteristic function of any set (XT)" 1 ’). 

This being so, we pass as usual to the case in which f is a finite 
function, simple and measurable (X^)"’’) and finally, with the help 
of Theorems 7.4 and 12.6, Chap. I, to the general case in which / 
is any non-negative function measurable (X S D‘" ). 

Condition (9.9) is essential to the validity of Theorems 9.8 and 9.10. To 
see this, let us consider some examples for which the condition is not fulfilled. 
Put .V Y~H„ and let it = ; 1) be the class of all sets in K, that are measurable 
in the sense of Lebesgue. We choose for /• the ordinary Lebesgue measure, and 
we define the measure »> by making t’(Y) equal to the number of elements of I" (so 
that v(Y)—oo if V is an infinite set). Finally, let Q be the set of all the points 
(x, x) in /i t =Xx V such that 0 -s; x 1 . The integrals occuring in condition 
(ii) of Theorom 9.8 are then respectively 0 and 1 so that condition (ii) does not 
hold. (We could also, by a suitable modification of the set Q, choose 1 = #f 2 
and take as measure v the length .1,; cf. Chap. II, § 8.) 
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Another example showing the importance of condition (9.9) is due to 
A. Lindenbaum. Put X—Y—R x , let £=$) be, the class of all Borel sets in R l9 and 
let fi(X)~v(X) denote for every set X the number of its elements. By a theorem 
of the theory of analytic sets (cf., for instance, C. Kuratowski [I, p. 261]) there 
exists in the plane R 2 =Xx V a Borel set Q such that the set of xe R x for which 
E [(ar, y)cQ] reduces to a single point, is not measurable in the sense of Borel. 
y 

In other words, the set of xeX for which i'{E[(a?, y)e<J>]) = 1 is not measurable (9E). 

y 

Thus condition (i) of Theorem 9.8 does not hold. 

For the results of this §, vide H. Hahn [2]; cf. also S. Ulam [2], Z. Lora- 
nicki and S. Ulam [1], and W. Feller [1]. For a discussion of Fubini’s theorem 
applied to functions whose values belong to an abstract vector space, vide also 
S. Bochner [2]. Finally we observe that certain theorems, analogous to those 
established in this § for measurable sets and sets of measure zero, can be stated 
for the property of Baire and the Baire categories. Cf. on this point C. Ku- 
ratowski and S. Ulam [1], 

§ 10. Geometrical definition of the Lebesgue-Stieltjes 
integral. The geometrical definition of an integral is inspired by 
the older and more natural idea of regarding the integral as the 
measure of an "area”, or of a “volume”, attached to the function 
in a certain way that is well known. 

Let us begin by fixing our notation. Given a function f{x) 
defined on a set Q(ZR m , we call graph of /(.r) on Q, and we denote 
by B(f;Q), the set of all points (x, y) of R m+ , for which xtQ and 
y—f( ,r)^oo. If f(x) is non-negative on Q, the set of all the points 
(x,y) of lt m H such that xeQ ami 0 ^.y^j(x) is termed, according 
to C. Oarathdodory, ordinate-set of f on Q and will be denoted by 
A(/; Q). 

As in §§ 3 — 7 we shall suppose the space R m fixed and a non- 
negative additive function V of an interval given in R m . And in 
accordance with § 2, p. 59 and § 5, p. 65, L x denotes the Lebesgue 
measure in R v 

(10.1) Lemma. 7/ Q C_R m is a set measurable (Cf/), the set 

E[.re(); a^y^b] in R m +i is, for every pair of real numbers a and 
(x-y) 

b^a, measurable (S(/l,), and its measure (U L.) is (b — a)-(J*(Q). 

Proof. Let us write for short, <^„,6=E[d!eQ; a*^.y<ib\ and 

(x,y) 

T—Uh v We shall begin by showing that if Q has measure (U) 
zero, the set Q a ,h is of measure ( T) zero, and so is certainly meas- 
urable ( j?r). 
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To see this, observe that there is then, for any f>0, a sequence 
of intervals {/„} in lt m such that QC£lh and £U(I, ,)<:e. Writing 

n n 

(/ n =^nX[a— f, &+£], we obtain a sequence of intervals {J„} in /i„ I+1 , 
such that Qa.bCUJh and (J n )=ZlJ (I n )-(b—< a-\-2t)^(b — a-\-2t)-f. 

n n n 

Thus T*(Q a , b )= 0. 

Let now Q be any set measurable (2u). By Theorem 6.6, Q is 
the sum of a sequence of closed sets and a set of measure (U) 
zero. Therefore, by the above, the set Q n , b is also the sum of a se- 
quence of closed sets and of a set of measure (T) zero, and is thus 
measurable (2r). Finally, for every real number y, we have 
E[(^, y)eQ, ,,*]--<? if a^.y^b, and E[(a:,y )<?()«, *3=0 if y is outside the 

X x 

b 

interval [a, ft]. Hence, by Theorem 8.6, we have T*(Q„ ih )= j U*(Q)dy— 

a 

= (b — a)-U*(Q), which completes the proof. 

(10.2) Theorem. If f(x) is a function measurable (fiy) on a set 
QCRm, its graph on Q is of measure (UL X ) zero. 

Proof. Since any set measurable (2u) can be expressed as 
the sum of a sequence of bounded measurable sets, we can restrict 
ourselves to the case in which Q is bounded. 

Let us fix an £>0 and write ^ n =E[xe^; n e^f(x)<(n + l)r] 

X 

for every integer n. By Lemma 10.1 the measure ( f/L,) of the graph 
of f(x) on Q„ does not exceed t-U*(Q n )', therefore, on the whole setQ, 
it does not exceed and so vanishes. 

We can now prove the following theorem which includes the 
geometrical definition of the Lebesgue integral: 

(10.3) Theorem. In order that a function f(x) defined and non- 
negative on a set QC_R m measurable (2u) be measurable (2u) on Q, it 
is necessary and siifficient that its ordinate-set A (/; Q) on Q be meas- 
urable (Cut,,). When this condition is fulfilled , the definite integral ( U) 
of f on Q is equal to the measure ( t/L,) of the set A (/; Q). 

Proof. Write, for short, T—U L x and suppose first that f(x ) 
is a simple function, finite, non-negative, and measurable (2 u) on Q, 
i.e. that /={d 1 ,Q 1 ; v^Qi] ...; v„,Q n } where Q t are sets measurable (2 u) 
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no two of which have common points. By Lemma 10.1, all the 
sets A are measurable (2 r ) and T*[A(f;Q i )]==v r U*(Q i ) 
for i=l, 2, ..., n. Hence, the set A(f-,Q)=^A(f-,Qi) is itself meas- 

i 

urable (2 r), and its measure (T) is equal to 2J T* [A(/; Qi)} — 
— 2 j Vi-U*(Qt)= : /'/(*) dU(x). 

1 Q 

Let now / be any non-negative function measurable (2 t) on Q. 
There is a non-decreasing sequence [h„(x)} of simple functions, 
finite, non-negative, and measurable (2 u) on Q such that / (.r ) =lim h„(x). 

it 

We then have 

(10.4) A(/; Q) = lim A(h n ; Q). 

n 

Now, by the above, all the sets A (h n -, Q) are measurable (2 r) and 

T*[A(h„-,Q)]==J h n dU for n— 1,2,.... On the other hand, by 

Q 

Theorem 10.2, the set B (/; #) has measure (T) zero. It therefore 
follows at once from (10.4) that the set A (/; Q) is itself measur- 
able (2 T ) and that 

T*[A(/; Q)] = lim /' h„dU=ffdU . 

n Q Q 

It remains to prove that, if the set A (fjQ) is measurable (2r), 
the function / is measurable (2 u). To do this, write for short 
A — A(f-,Q), and observe that, for every non-negative number y, 
the set E[a;e<2; f(x)^y] coincides with the set E[(a;, y)eA]. 

X x 

Thus, by Theorem 8.6, it A is measurable (2r), the set E[#eQ; f(x)^y] 

X 

is measurable (2 u) for all numbers y except at most those of a set 
of measure (Li) zero. But this suffices for the measurability of / 
on Q (cf. Chap. I, (7.2)) and so completes the proof. 


* $ 11. Translations ’of sets. As an application of Theorem 
8.6, we shall prove in this § a theorem on parallel translations 
of sets. As a matter of course, in what follows, translations could 
be replaced by rotations, or by certain other transformations consti- 
tuting continuous groups and preserving Lebesgue measure. 
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Given two points x=(x h ..., x m ) and y = (y i, yi, ..., y m ) in the 
space /£„„ we shall denote by x-\-y the point (Xi+y\,xz+yz, —,x fn -{-y, n ) 
and by \x\ the number (x] -f x] + ... + x ? m) ' We shall write 
x > 0 when |a?|~*0. If # is a set in the space and a any 
point of this space, Q {n) will denote the set of all points x+a where 
x t Q. The set Q in) is termed translation of Q by the vector a. If ft is 
-an additive function of a set in lt m and aeU m , we shall write 
ft {n) (X) = ft(X {a) ) for every set X bounded and measurable (33). 

(11.1) Theorem . If Q is a bounded set (33) of measure (L) zero in the 
space Ii m and ft* is an additive function of a set (*©) in lt m , the func- 
tion ft vanishes for almost all translations of Q, i. e. ft(Q (a) ) — ft {a \Q)—0 
for almost all points a of R m . 

Proof. We may clearly assume ft to be a non-negative function, 
and Q to be a bounded set (©<,). Hence, by Theorem 6.10, there is 
a non-negative additive function U of an interval, such that 
ft(X)=U*(X) for every set X bounded and measurable (33). 

Denote by M , for any set MC_lt m , the set of all points (x, y) 
of the space which are such that xclt m , yelf m and x+yfM. 

The set M is clearly open whenever the set M is open. It follows 
at once that if M is a set (© ( s), so is the set M. Finally, observe that 
for every point zeR m we have E[(#, z) eM]=E[(z, y)eM] = M { ?) . 

-r !l 

Since the given set Q is, by hypothesis, a set (©<0, so is the 
set#, and by Theorem 8.6, f U*(Q { ~ z) )dL m (z) = j*L m (Q { ~ -*))dU(z)= 0, 

Km Km 

because ail translations Q { ~ z) of the set Q are of measure (L„,) zero. 
Hence ft(Q { z) )mU*{Q { z) ) = 0 for every zelt nn except at most a set 
of measure (L„,) zero. Replacing — z by a, we obtain the required 
statement. 

(11.2) Theorem. Given an additive function of a set ft , each of 
the following three conditions is both necessary and sufficient for the 
function ft to be absolutely continuous: 

1° lim ft(Q {a) ) = lim ft {a \Q)—ft(Q) for every bounded set Q meas - 

rt-X> <7->0 

urable (33) and of measure (L) zero ; 

2° lim ft(Q {a) ) = lim ft {n) (Q) = ft(Q) for every bounded set Q 

o-M) 

measurable (©); 

3° limW[<P ( " ) — </>;/]= 0 for every interval I. 
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Proof. It is evidently sufficient to establish the necessity 
of condition 3° and the sufficiency of condition 1 °. 

Suppose first that is an absolutely continuous additive 
function 'of a set. In virtue of Theorem 14.11, Chap. I, <P is thus 
the indefinite integral of a function / measurable (93). Let 
1 = [«i> &i> •••; a m , b,„] be an interval in the space considered and let J 
be an interval containing I in its interior, for instance the interval 
[«i — 1 , fti + 1 ; ...; «„ — 1 , 6 m +l]. 

Let f be any positive number. Since the function f(x) is in- 
tegrable over J, there exists a number 17 X) such that f\f(x) dx<e/3 
x 

for every set X(2J measurable (93) and of measure (L) less than rj. 
Therefore 

I \f(x)\dx<.e/3 and f \f(x+u)\ dx— f | f(x) dx<Cd3 
(H-3) x x x(u) 

if Xe 93, XCI, \*\<V and \u\ < 1 . 

On the other hand, by Lusin’s Theorem 7 . 1 , there exists a closed 
set FCI° such that the function / is continuous on F and such 
that 1 1 — F\<.t]/2. Let a < 1 be a positive number such that 
F {u) C_I whenever |tt|<o, and such that 

(11.4) \f(x+u) — /(o5)|<^jj| whenever xeF, x+ueF, and |«|<<r. 

Let now a be any point of R m such that |a|<o. By ( 11 . 4 ) 

(11.5) f \f{x-\-a) — f(x)\ «te<|J|.(f/3 I\)—e/3. 

On the other hand, | I—F F { - a) \^\I—F\+\I—F ( ~ a) \^2-\I-^F\<r], 
and therefore, by ( 11 . 3 ), 

J \f(x+a) — f(x)\dx < 2f/3. 

If we add this inequality to (11.5) we obtain f \f(xj-a) -f(x) j dx<e, 

1 

i. e. the variation W[<P (fl) — 0-, I] = f \f(x+a)—f(x)\dx tends to 0 

with |tf|. The function <J> therefore fulfills condition 3 °. 

It remains to prove the sufficiency of condition 1 °. Now, if 
the function fulfills this condition, O vanishes by Theorem 11.1 for 
every bounded set measurable (93) of measure (L) zero, and so is 
absolutely continuous. 
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It was long known that every absolutely continuous function 4> fulfills 
conditions 1°, 2° and 3° of Theorem 11.2. The converse, however, (i. e. the suf- 
ficiency of these conditions, in order that the function 0 of a set be absolutely 
continuous) was established more recently. The sufficiency of condition 3° was 
first proved by A. Plessner [1] (with the help of trigonometric series and for 
functions of a real variable). As regards the other conditions (1° and 2°), and 
as regards Theorem 11.1, vide H. M ilicer-Gruz ewska [1], and N. Wiener 
and R. C. Young [1]. In the text we have followed the method used by the 
latter authors. 


§ 12. Absolutely continuous functions of an Interval. 

An additive function F of an interval will be termed absolutely con- 
tinuous on a figure R 0 , if to each £>0 there corresponds a number 
f?>0 such that for every figure f?C^o the inequality implies 

|jP( 1?)! < f. In conformity with § 3, p. 59, we shall understand by 
absolute continuity in an open set 0, absolute continuity on every 
figure RCO, and by absolute continuity, absolute continuity in the 
whole space. 

Every additive function of an interval, absolutely continuous on 
a figure R 0 , is of bounded variation on R 0 . For, if F is a function 
that is absolutely continuous on R 0 , there exists a number *?>0 
such that, for every figure R(ZR 0 , the inequality |/?| < implies 
|F(f£)|<l. Therefore, if we subdivide R 0 into a finite number of 
intervals 1 1 , h, ..., /„ of measure less than t?, we obtain W(F; R 0 )^. 

&=1 

An additive function of an interval F , of bounded variation 
on a figure R 0} will be termed singular on R 0 , if for each £>0 there 
exists a figure R(ZRq such that |i?|<£ and W(.F; R 0 QR) <r. 

The reader will observe the analogy between the above definitions and 
the criteria given in Theorems 13.2 and 13.3, Chap. I, in order that an additive 
function of a set should be absolutely continuous or singular. This analogy could 
be pushed further by introducing the notions of absolutely continuous function, 
and of singular function, with respect to a non-negative additive func- 
tion of an interval. But this “relativization”, although useful in certain 
cases, would not play an essential part in the remainder of this book. 

The following theorem is, almost word for word, a duplicate 
of Theorem 13.1 of Chapter I. 
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(12.1) Theorem . 1° In order that an additive function of an 

interval be absolutely continuous [ singular ] on a figure R 0 , it is neces- 
sary and sufficient that its two variations , the upper and the lower, 
should both be so. 2° Every linear combination , with constant coefficients , 
of two additive functions of an interval which are absolutely continuous 
[singular] on a figure R 0 is itself absolutely continuous [ singular ] 
on R 0 . 3° The limit of a bounded monotone sequence of additive functions 
of an interval that are absolutely continuous [ singular ] on a figure R 0 
is also absolutely continuous [ singular ] on R 0 . 4° If an additive function 
of an interval is absolutely continuous [ singular ] on a figure R 01 the 
function is so on every figure R(2R 0 . 5° If an additive function of 
an interval is absolutely continuous [ singular ] on each of the figures 
R x and R 2 , the function is so on the figure R x +R 2 . 6° An additive function 
of an interval cannot be both absolutely continuous and singular on 
a figure R 0 , without vanishing identically on R 0 . 

Part 3°, at most, perhaps requires a proof. (It differs slightly 
from the corresponding part of Theorem 13.1, Chap. I.) Let therefore 
F be the limit of a bounded monotone sequence {F n \ of additive 
functions of an interval on a figure R 0 . Let e be any positive number. 
Since the functions F — F n are monotone on K 0 , there exists a pos- 
itive integer n 0 such that 

(12.2) \F(R)—F no (R)\^\F(R 0 )—F„ u (li 0 )\<el2 for every figure RQR^. 

This being so, let us suppose that the functions F n are ab- 
solutely continuous on R 0 . There is then an » 7>0 such that, for 
every figure R(^R 0 , implies the inequality \F„ 0 (R)\< e/2 

and therefore, by (12.2), the inequality |i' , (12)| <r. The function F 
is thus absolutely continuous on R 0 . 

Suppose next that the functions F„ are singular on R 0 . There 
is then a figure such that | K l |<e and WfA’n,,; A^OA^] < e/2. 

Hence, by (12.2), W[F-, ff 0 QJJ 1 ]<f, which shows that the function 
F is singular. This completes the proof. 

We shall now establish two simple theorems that show ex- 
plicitly the connection between the absolutely continuous or sing- 
ular functions of an interval and those of a set. To avoid misunder- 
standing, we draw the reader’s attention to the abbreviations 
adopted in § 5, p. 66, in the terminology of functions of a set. 
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(12.3) Theorem. In order that a non-negative additive function F 
of an interval be absolutely continuous, it is necessary and sufficient 
that the corresponding function of a set F* should be so. 

Proof. Suppose that the function F is absolutely continuous. 
In order to prove that the function F* is so too, it is enough to 
show that F* vanishes on every bounded set of measure (L) zero. 
Let therefore E be such a set, and let J be an interval that contains 
E in its interior. For any e> 0, let rj be a positive number such that 

(12.4) im plies \F(R)\ <C £ for every figure RCJ ■ 

Since |15|=0, there exists a sequence of intervals {!„} in J such that 

(12.5) EQ I In and Z\I n \<V- 

n n 

Denote by P* the sum of the k first intervals of this sequence. 
By Theorem 4.6 (or 6.1) of Chap. II, and Theorem 6.2, the relations 
(12.4) and (12.5) give F*(E) ^ lim F*(R%) ^ limF(J2*) ^f, from 
which it follows that F*(E) = 0. 

Conversely, if F* is an absolutely continuous function of a set, 
the absolute continuity of F follows at once from the inequality 
F(R) <^F*(R) which holds by Theorem 6.2 for every figure R. 

(12.6) Theorem. In order that a non-negative additive function 
of an interval F be singular , it is necessary and sufficient that the 
corresponding function of a set F* should be so. 

Proof. Suppose that the function of an interval F is singular, 
and let J be any interval. Given any number e ,>0, there is then 
a figure R C<I such that |P|<c and F(JQR)<^e. Consequently, 
by Theorem 6.2, we have F*(J° — R)CF (JQR)<s, which shows 
on account of Theorem 13.3, Chap. I, that the non-negative function 
of a set F* is singular in the interior of every interval J , and there- 
fore in the whole space. 

Suppose, conversely, that the function of a set F* is singular, 
and let e be any positive number. Given any interval I there is then 
a set ECI° such that |P| = 0 and F*(I° — E)= 0. Consequently, 
there is a sequence of intervals {I,,} in I such that 

(12.7) I°— EC 2 In and (12.8) ZF(I n )<e. 

n n 

Denote by P* the sum of the k first intervals of this sequence. Since 
|P|=0, we obtain from (12.7) that |P*J > |Z| — e for a sufficiently 
large k 0 , and writing P=/0i£* o , this gives |P|<r. Again, by (12.8), 
P(I©P)< f, which proves that the function F is singular. 
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§ 13. Functions of a real variable. The most important 
of the notions and theorems of this chapter were originally given 
a rather different form: they were made to refer, not to additive 
functions of an interval, but to functions of a real variable. It is, 
however, easy to establish between functions of a real variable and 
additive functions of a linear interval, a correspondence rendering 
it immaterial which of these two kinds of functions is considered. 

To do this, let f(x) be an arbitrary finite function of a real 
variable on an interval Z 0 . Let us term increment of f(x) over any 
interval I = [a y b] contained in Z 0 , the difference f(b) — f{a). Thus 
defined the increment is an additive function of a linear interval 
Z(~Z o, and corresponds in a unique manner to the function f(x). 
Conversely, if we are given any additive function F(I) of a linear 
interval Z, this in itself defines, except for an additive constant, 
a finite function of a real variable f(x) whose increments on the 
intervals Z coincide with the corresponding values of the function F(I). 

We shall understand by upper , lower and absolute , variations 
of a function of a real variable f(x) on an interval Z, the upper, 
lower, and absolute, variations of the increment of f(x) over Z. 
To denote these numbers, we shall use symbols similar, to those 
adopted for additive functions of an interval, i. e.: W(/; Z), W(/; Z), 
and W (/; Z). 

A finite function will be termed of bounded variation on an 
interval Z 0 , if its increment is a function of an interval of bounded 
variation on Z 0 . Similarly the function is absolutely continuous y or 
singular , if its increment is absolutely continuous, or singular. As we 
see immediately, in order that a function f(x) be of bounded vari- 
ation on an interval Z 0 , it is necessary and sufficient that there 
exists a finite number M such chat £\f{bi) — /(a/)|<JZ for every 

sequence of non-overlapping intervals {[a/, bf]) contained in Z 0 . 

Similarly, in order that f(x) be absolutely continuous, it is necessary 

and sufficient that to each £>0 there corresponds an t?>0 such 

that 2j\ f(bi ) — ■/(«/)( <* for every sequence of non-overlapping inter- 
/ 

vals {!>/,&/]} contained in Z 0 and for which b t — 
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If /( x) and g{x) are two bounded functions on an interval I o* 
and M denotes the upper bound of the absolute values of f(x) and 
g(x) on Z 0 , we have 

\f(b)g(b) -f(a)g(a)\ < M[\f(b) —f(a)\ + |< 7 (&)_ ff («)|] 

for every interval [a, 6] C 7 0 - It follows at once that 

(13.1) The product of two functions of bounded variation [ absolutely 
continuous ] on an interval is itself of bounded variation [ absolutely 
continuous ] on this interval. 

Finally we see that if a function of an interval F corresponds 
to a finite function of a point / (i. e. is the increment of /), we have 
o /(/; a)=o,(F; a) for any interval I and any point a el (cf. Chap. II, 
§ 3, p. 42, and the present Chapter, § 3, p. 60). Thus, in particular, 
in order that the function / be continuous at a point a according 
to the definition of § 3, Chap. II, it is necessary and sufficient that 
the function of an interval F that corresponds to / should be so 
according to the definition of § 3 of the present Chapter. 

If at a point a a function of a real variable / has a unique, 
limit on the right, this limit will be denoted by /(<*+); similarly, 
f(a — ) will stand for a unique limit on the left. If the function / 
is defined in a neighbourhood of a point a and both limits /(a+), 
/(a — ) exist, then the oscillation o (/; a) ( vide Chap. II, p. 42) is 
equal to the largest of the three numbers |/(a-f) — /(a — -)|, 
|/(«+)— /(<*)|, and |/(«— )— f(a)\. 

If both limits /(«+ ) and f(a — ) exist and are finite, and 
f(a)—^[f(a-\-)+f{a — )] the function f(x) is termed regular at the 
point a. It is regular if it is regular at every point. 

Let / be any function of a real variable, of bounded variation, 
and {«„} a sequence of points. Let us put s(a) — 0 and 

/(«+)— / («) +2’ <a,x) [/ (««+)— /(««—)]+/(*)— /(*— ) for x>a 

n 

f(ar-)—f(a)+^ a ' x \f(a n —)—f(a n +))+f(x)—f(x+) for x<a, 

n 

where the summation £ (a,x) is extended to all indices n such that 

n 

d<ia n <Cx, when a?>a, and a>a n >rr, when a?<a. The function 8 
thus defined is termed the saltus-function of f corresponding to the 
sequence {a„} of points . It is continuous everywhere except, perhaps, 
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at the points a„, and by subtracting it from f we obtain a function 
of bounded variation, continuous at all points of continuity of / and, 
besides, at all the points a„. If {a,,} is the sequence of all points of 
discontinuity of /, the corresponding function s is called simply 
the saltU8 -function of f. By varying the fixed point a we get the 
various saltus-functions of / which can obviously differ only by 
constants. A function of bounded variation which is its own saltus- 
function, is called a saltus-f unction. 

The functions of a real variable whose increments over each 
interval I coincide respectively with the variations W (/; I), W(/; I) 
and W (/; I) of a function /, are also termed (upper, lower, and ab- 
solute) variations of f. By applying the Jordan decomposition (§ 4, 
p. 62), we can express any function of a real variable / of bounded 
variation as the sum of two functions that are respectively its upper 
and lower variations. Thus any function of bounded variation is 
the difference of two monotone non -decreasing functions, and con- 
sequently is measurable (33) and has at every point the two uni- 
lateral limits, on the right and left. Moreover, the set of its points 
of discontinuity is at most enumerable, since the sum of its oscil- 
lations at the points of discontinuity lying in any finite interval is 
always finite (this is actually the special case of Theorem 4.1). 

In various cases it is more convenient to operate on functions 
of a real variable than on additive functions of an interval in Ii v 
The difference is, of course, only formal, and all the definitions 
adopted for functions of an interval can be stated, with obvious 
modifications, in terms of functions of a real variable. We need 
not state them here explicitly. If F is a function of a real variable, 
of bounded variation, the meaning of expressions such as Lebes- 
gue-Stieltjes integral with respect to F, integral (F), 
sets (2 f), and so on, may be regarded as absolutely clear, in view 
of the definitions of § 5. If F is a continuous function and g 

a function integrable (F), the integral j gdF, where I is a variable 

i 

interval, is an additive continuous function of an interval I (vide 
§ 5, p. 65). There is, consequently, a continuous function of a real 
variable whose increment on any interval I coincides with the 
definite integral (F) of g over I. This function, which is determined 
uniquely except for an additive constant, is also termed indefinite 
integral (F) of g. 
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When there is no ambiguity, the additive function of an interval 
that is determined by a finite function of a real variable F, will 
be denoted by the same letter F, i. e. F(I) will stand for the in- 
crement of F(x) on an interval I. By means of the corresponding 
function of an interval, any function of a real variable F of bounded 
variation determines an additive function of a set which we denote 
by F* (cf. § 5). We see at once that F*(X)=F(b+) — F(a — ) when 
X— [a, 6], and that F*(X)=F(a+) — F(a — ) when X~(a), i. e. 
when X is the set consisting of a single point a. 

If W ( x ) is the absolute variation of a function F (x) of bounded 
variation, we clearly have W {F*-, X) <!TP*(Z) for every set X 
bounded and measurable (©). The opposite inequality does not 
hold in general. If, for instance, I is a set consisting of one point 
only, and F is the characteristic function of X, then W (F*-,X)= 
—F*{X)= 0, while W*(X)—2. We can, however, state the following 
theorem: 

(13.2) Theorem. If F(x) is a function of a real variable of bounded 
variation, and W (x) is the absolute variation of F(x), then W (F*-, X)— 
= W*(X) for every set X bounded and measurable (©) at all points 
of which j F( x) is continuous. 

Proof. Suppose first that the set X is contained in an open 
interval J 0 in which the function F (x), and consequently the function 
W (x) also, is continuous. Let G(ZJ 0 t )e an arbitrary open set such 
that X(Z G. Then, expressing G as the sum of a sequence of closed 
non-overlapping intervals {!„}, we get W*(X)^.W*(G)=£W*(I, ,)= 

n 

= whence W*(X)^W(F*;X), and 

n n 

since the opposite inequality is obvious, W*(X) = W(F*-, X). 

Let us pass now to the general case. Let I 0 be an interval 
containing X in its interior, and let e>0. Denote by {a,,} the se- 
quence of points of discontinuity of F(x) interior to I 0 , and by S N (x) 
the saltus-function of F(x) corresponding to the points a n for n>N. 
Let us put G(x)=F(x) — S N (x), where N is a positive integer suf- 
ficiently large in order that W(S N ; f 0 )O- The points a,, a if ..., a N , 
none of which belongs to X, divide 7 0 into a finite number of sub- 
intervals in the interior of which the function G(x) 

is continuous. Hence, denoting by V (x) the absolute variation of 
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G(x), there follows, by what has already been proved, V*(X-J k ) = 
=W(G*; X-Jk) for *==0, 1, ..., N } whence F*(X)=W(<?*; X). On 
the other hand, \W*(X)—V*(X)\ and |W(F*; X)— W(G*; X) are 
both at most equal to W($Ar;Z 0 Xf. Thus \W*{X) — W(F*; X)|^2e, 
and finally W*(X) = W(F*; X). 

If F(x) is a finite function of a real variable and E an arbit- 
rary set* in R v the set of the values of F(x) for peE will be denoted 
by F[E]. 

(13.3) Theorem . If F{x) is a function of a real variable of 
bounded variation and W (x) is the absolute variation of F(x ), then 
\F[E]\ ^W*(E) for every set E in and if further the function 

F(x) is non-decreasing , and continuous at all points of 2?, then 
\F[E]\ = F*(E). 

Proof. Let f be a positive number and {/„) a sequence of 
intervals such that EC 11 In and W*(E)-\-e^*ZW(I n ). Then, if m n and 

n n 

M„ denote the lower and upper bounds, respectively, of F(x) on I„, 
the sequence of intervals |[m„, M„]) covers the set F[E], and con- 
sequently \F[E]\^Z(M n — m n )^ZW(I„)^W*(E) + e. Hence, 
| F[E]\^W*(E). 

Suppose now F{x) continuous at the points of E and non- 
decreasing. By what has already been proved, \F[E]\^F*{E). To 
establish the opposite inequality, let rj be an arbitrary positive 
number, and {</„} a sequence of intervals subject to the conditions 
F[E]C_ £j tt . and \F[E]\+Tj^H\J n \. Let E n denote the set of the 

n n 

points xeE such that F(x)eJ n . Then F*(E„)^.\ J„\ for each «; and 
consequently F*(E)^Z\J„\^\F[E]\ + ri, whence F*(E)^\F[E]\. 


The characteristic function of a set consisting of a single point provides 
the simplest example of a singular function of a real variable, that does not vc.~ish 
identically. This function is however discontinuous. It is easy to give examples 
of functions of an interval that are additive, singular, continuous, and not iden- 
tically zero, in the spaces li m for 2. For simplicity, consider the plane, and 
denote, for any interval Z, by F(l) the length of the segment of the line y — x 
contained in Z; the function of an interval F(I) will evidently have the desired 
properties. A similar example for is less trivial. We shall therefore conclude 
this § with a short description of an elementary method for the construction of 
continuous singular functions of a real variable. 

We shall begin with the following remark which frequently proves useful. 
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(13.4) Let E be a linear , bounded , perfect and non-dense set , and a and p a 
two arbitrary numbers . T/tew, if a and b denote the lower and upper bounds of E , 
a function F(x) may be defined on the interval J Q — [a, b] so as to satisfy the following 
conditions: (i) F(a) — a, F(b) = /?, (ii) F(z) is constant on each interval contiguous 
to the set E, and (iii) F(x) is continuous and ywn-decreasing on the interval J 0 and 
strictly increasing on the set E. 

To see this, let {l n } be the sequence of intervals contiguous to E f and let 
us agree to write In^Im whenever the interval I n is situated on the left of l m . 
By induction (cf. e. g. F. Hausdorff [II, p. 50]) we can easily establish a one- 
to-one correspondence between the intervals l n and the rational numbers of the 
open interval (a, f$) so that, denoting by u(I n ) the number which corresponds 
to the interval I nt the relation I n ^Im implies u{I n ) u(I m ). Let us now put 
F (x)=u(In) for xel n where n— 1 , 2, ..., and then extend F(x) by continuity 
to the whole of the interval J 0 . We see at once that the function F{x) thus ob- 
tained satisfies all the required conditions (i), (ii) and (iii) of (13.4). 

Now let us choose for the set E in (13.4) a set of measure zero. Then if 
{In} is the sequence of the intervals contiguous to £*, we have 

n n 

Ik) — 0 

A- 1 

n 

for each positive integer n; and since |J 0 — £ Ik | —* 0 as n-> oo, the func- 

k—\ 

tion F(x) is evidently singular on the interval J 0 . 

The singular function obtained by the foregoing construction is continuous 
and monotone non -decreasing; the function is not constant on the whole interval J Ql 
but is so on certain partial intervals. Now, by the method of condensation 
of singularities, it is easy to derive from it a singular continuous function 
that increases everywhere. 

To do this, suppose in (13.4), \E\— 0, a=a= 0, b = p^= 1, and extend the 
function F(x) on to the whole axis R x by stipulating F(x + 1)= l-fF(jr). Write 

<>«> 

This series is a uniformly convergent series of singular functions, since F(ns) is 
clearly singular with F(x ). Now the functions F(nx) are monotone non -decreasing. 
By Theorem 12.1 (3°), the function H(x) is thus singular. This function is also con- 
tinuous, as the limit of a uniformly convergent series. To prove that H(x) is strictly 
increasing, let x x and x % >x x denote an arbitrary pair of points in [0, 1]. For 
n> l/(x t — JCj), we have nx t — imp, >1, and consequently F(nx t )> F(nx x ); 
while for every n, F(nx % )^F(nx x ), whence by (13.5), H(x t )> H(x x ) as asserted. 

Various examples of this kind have been constructed by A. Den joy [l] t 
W. Sierpihski [3], H. Hahn [I, p. 538], L. C. Young [1] and G. Vitali [4]; 
cf. also O. D. Kellog [1], and E. Hille and J. D. Tamarkin [1], 
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§ 14. Integration by parts. As in the preceding §, we shall 
deal only in this § with functions of a real variable. For the latter, 
we shall establish two classical theorems, of importance on account 
of their many applications to various branches of Analysis. We shall 
first prove them for the Lebesgue-Stieltjes integral and then spe- 
cialize them for the ordinary Lebesgue integral. 

(14.1) Theorem on integration by parts. If U(x) and V(x) 
are two functions of bounded variation, we have for every interval 

J 0 = LMJ 

b b 

[UdV + f VdU=U(b+)V(b+)—U(a—)V(a—), 

n n 

provided that at each point of I 0 either one at least of the functions 
U and V is continuous, or both are regular. 

Proof. In order to simplify the notation assume «=() and 

b — 1 , and consider the triangle Q= E [0 ^ 1 ; y on the plane JB 2 . 

U. y) 

The set E[(a?, y)eQ] is then the interval [y, 1] or the empty set, 

x 

according as y belongs, or does not belong, to the interval [0, 1]. 

Similarly, E [(.r, y)eQ] is the interval [0, .r] or the empty set, 
y 

according as we have, or do not have, Hence, by Fubini’s 

theorem in the form (8.6), 

i i 

/ [ U(l + )— U(y — )] dV(y) = /[ V(x + ) — F(0— )] dU(x), 

6 o 

l. e. 

i i 

(14.2) fu(x-)dV(x)+ j'V(x+)dU(x)=U(l+)V(l + )-U(0-)V(0-). 

o 6 

Interchanging U and V and adding the corresponding equation 
to (14.2), we get, on dividing by 2, 

1 i 

(14.3) [ i[U(x+) + U(x—)]dV(x) + f \[V(x + )+V(x—)]dU(x) = 

6 o 

= U( 1 + ) V( 1 + ) ■ — 17(0— ) F(0— ). 

Let M be the set of the points in / 0 =[0, 1] at which the 
function U (x) is regular. Then 

(14.4) J \tU(x+)+V(x—)]dV(x)= f U(x)dV(x). 
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On the other hand, the set I 0 — M is at most enumerable and, by 
hypothesis, the function V (x), and consequently both its relative 
variations, are continuous at each point of I 0 —M. Thus, the def- 
inite integral ( V) of any function over the set 1 0 — M is zero, and 

1 i 

it follows from (14.4) that f \[U(x-\-)-\-U{x — )]dT(®)= J U(x) dV(x). 

o o 

Similarly, the second member on the left-hand side of the rela- 
tion (14.3) is equal to J V(x)dU(x), and this relation may be written 

o 

i i 

UdV + f VdU=U( 1 + ) F(l+) — 17(0— ) F(0— ), 

0 

which proves the theorem. 

The theorem may be also proved independently of Fubini’s theorem, but 
then the proof is slightly longer. The proof given abo\e was communicated to 
the author by L. C. Young. 


/ 


(14.5) Second Mean Value Theorem. If U(x) and V{x) are 
two non-decreasing functions and the function V (x) is continuous f 
then in any interval [a, b ] there exists a point £ such that 

b 

(14.6) f U dV=U(a)-[V(£) -V(a)]+U(b)-[V(b)—V(£)]. 

a 

Proof. Since the values of U(x) outside the interval [a, 6] 
do not affect (14.6), we may suppose that U(a — )=U (a) and, 
U (b-)-)=U (b). Therefore, making use of Theorem 14.1 and of the 
first mean value theorem (Chap. I, Th. 11.13), we obtain 

b b 

(14.7) f UdV=V(b)V(b) — U(a)V(a) — f VdU= 

a (i 

= U(b) V(b) —U(a) V(a ) — n • [ U(b) — 17(a)], 

where n is a number lying between the bounds of the function V (x) 
on [a, 6]. But, since this function is by hypothesis continuous, there 
exists in [a, i>] a point £ such that V (£). Substituting this value 
for (i in (14.7) we obtain the relation (14.61. 
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As a special case of Theorem 14.1 we have the following theorem 
on integration by parts for the Lebesgue integral: 

(14.8) Theorem. If u(x) and v(x) are two summable functions 
on an interval [a, 6] and U ( x ) and V (x) are their indefinite in- 
tegrals (L), then 

b b 

(14.9) f U(x)v{x)dx+ f V(x)u(x)dx=U(b)-V(b)—U(a)V(a). 

b a 

Proof. Observe first that by writing for instance u(x) = 0 
and v(x) = 0 outside the interval [a, 6], we may suppose that the 
functions u(x) and v(x ), and their indefinite integrals U (x) and V ( x ), 
are defined on the whole straight line ll x . Also, by altering, if neces- 
sary, the values of the functions u{x) and v(x) on a set of measure 
(L) zero, which does not affect the values of the integrals in (14.9), 
we may suppose that these functions, together with the functions 
U(x)v(x) and V(x)u(x), are measurable (*8) (cf. Theorem 7.6 
of Vitali-Carath<k>dory or else Lusin’s Theorem 7.1). We may, 
therefore write, according to Theorem 15.1, Chap. I, 

b b b b 

J V (x)v(x)dx= j U(x)dV(x) and J V(x)u(x)dx = IV(x)dU(x), 

a a a a 

and (14.9) follows at once from Theorem 14.1. 

Similarly, we derive at once from Theorem 14.5 the second 
mean value theorem for the Lebesgue integral: 

(14.10) Theorem. If U (x) is a non- decreasing function on an 
interval [a, b ] and v(x) is a summable function on this interval , then 

b # b 

j V(x) v(x)dx = U(a) j v(x) dx + U (b) Jv(x)dx , 

a a £ 


where £ is a point of [a, b]. 
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Derivation of additive functions of a set 
and of an interval. 

§ 1. Introduction. In this chapter we shall study Lebesgue’s 
theory of derivation of additive functions in a Euclidean space of 
any number of dimensions. When other spaces are considered, or 
when we specialize our space (to be, say, the straight line R t or 
the plane R 2 ), we shall say so explicitly. 

In what follows, an essential part is played by Vitali’s Covering 
Theorem (vide, below, § 3) which is restricted to the case of Lebes- 
gue measure. For this reason, the theorems of the present chapter 
have not in general any complete or direct extension to other meas- 
ures, not even when the latter are determined by additive functions 
of an interval. In accordance with the conventions of § 5, Chap. Ill, 
the terms measure, integral, almost everywhere, etc. will 
be understood in the Lebesgue sense whenever we do not explicitly 
assign another meaning to them. Similarly, by additive functions 
of a set we shall always mean functions of a set (53) (some of which 
may of course be continued on to wider classes of sets, cf. Chap. Ill, §5). 

We have already remarked in § 1, Chap. I, that any additive 
function of a set <P in a space R m , may be regarded as a distribution 
of mass. It is then natural to consider the limit of the ratio 0(8)l\8\ 
where 8 denotes a cube, or a sphere, with a fixed centre a and with 
diameter tending to 0, as the density of the mass at the point a. 
By the fundamental theorem of Lebesgue (vide, below, Theorem 5.4) 
this limit exists almost everywhere. Moreover Lebesgue has shown 
that in the above ratio, S may be taken to denote much more gene- 
ral sets than cubes or spheres. Of these, further details will be 
given in the next $. 
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§ 2. Derivates of functions of a set and of an interval* 

Suppose given a Euclidean space By parameter of regularity r (E) 
of a set E lying in this space, we shall mean the lower bound of 
the numbers \E\j\J\ where J denotes any cube containing E . Thus 
when E is an interval, V n fL m ^.r(E)^l / L, where l denotes the 
smallest and L the largest of the edges of J57; in particular, the 
parameter of regularity of a cube is equal to 1. 

A sequence of sets {E n } will be termed regular , if there exists 
a positive number a such that r(2?„)>a for n~ 1, 2, ... . 

W e shall say that a sequence of sets { E „} tends to a point a, if E tl )~>0 
as n->oo ? and the point a belongs to all the sets of the sequence. 

Given a function of a set 0 (not necessarily additive) we call 
general upper derivate of 0 at a point a the upper bound of the 
numbers l such that there exists a regular sequence of closed sets {E n } 
tending to a, for which lim 0( E n )/\E n \ = 1. We shall denote this 

n 

derivate by D 0(a). Similarly, merely replacing the closed sets 
by intervals, we define the ordinary upper derivate of 0 at a 
point a, and we denote it by 0(a). If we remove the condition 
of regularity of the sequences of intervals considered, we obtain 
the definition of strong upper derivate. In other words the 
strong upper derivate of 0 at a point a is the upper limit of the 
ratio 0(I)l\I\, where / is any interval containing a, whose diameter 
tends to zero. This derivate will be denoted by 0±(a). 

The three lower derivates at a point a, general D(P(a), ordinary 
<P(a) y and strong 0 s (a ), have corresponding definitions, and if at 
a point a the numbers D0(a) and D0(a) are equal, their common 
value is termed general derivative of 0 at the point a and will be 
denoted by D 0(a). If further D#(a)-J=oo, the function (Pis said 
to be derivable in the general sense at the point a. Similarly we define 
the ordinary derivative P'(a) and the strong derivative 0 8 (a ), as well 
as derivability in the ordinary sense , and in the strong sense , of the 
function 0 at the point a. Sometimes the derivatives D P (a), 0'(a) 
and 0 s (a) are termed unique derivates, while the upper and lower 
derivates (general, ordinary and strong) are termed extreme derivates. 
At any point a, we clearly have D0(a)^i0(a)^ : 0(a)^ l D0(a) 
and similarly 0%(a) ^0(a) ^ 0(a) ^ <P s (a); so that the existence 
either of a general derivative, or of a strong derivative, always 
implies that of an ordinary derivative. On the other hand, no such 
relation holds between the general and the strong extreme derivates. 
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It may be noted that in order that it be possible to determine 
the general derivates of a function of a set the latter must be 
defined at any rate for all closed sets; whereas in order to determine 
the extreme ordinary derivates, or the extreme strong derivates, 
we need only have the function defined for the intervals. This 
is why the process of general derivation is most frequently applied 
to additive functions of a set, and that of ordinary, or of strong, 
derivation to functions of an interval. We shall often omit the terms 
“ordinary”, “in the ordinary sense”, in expressions such as “ordinary 
derivate”, “derivability in the ordinary sense”. 

We have seen (Chap. Ill, § 5) that an additive function of 
an interval F of bounded variation determines an additive function 
of a set F*. Let us mention, in the case in which the function F 
is non-negative, an almost evident relation between ordinary deri- 
vates of F and general derivates of F*: 

(2.1) Theorem . If F is an additive non-negative function of an inter - 
valj then , at any point x , which belongs to no hyperplane of discontinuity 
of F , we have D F*(x)^F(x)^F(x)^DF*(x). 

In particular therefore , the ordinary derivative F'(x)^DF*(x) 
exists at almost every point x at which F * has a general derivative . 

Proof. Since F(I)^.F*(I) for any interval I, the inequality 
F(x)^HF*(x) is obvious. On the other hand, let l denote any number 
exceeding F{x). Then there exists a regular sequence of intervals {I n } 
tending to the point x and such that lim F(I n )l\I n \<l. Since x does 

n 

not belong to any plane of discontinuity of F, we may asssume 
that it is an internal point of all the intervals Z„. Hence we can 
make correspond to each interval Z„ an interval J„C.Ih such 
that xeJ„, |J„|^(1 — l/n)-|Z„| and r(J„) = r(Z„). We then have 
lim m\> F*(J n )l\J„\^ilimF (I n )l\I n \<l- Now since {J„) is a regular 

n n 

sequence of intervals tending to x , it follows that DF*(x)^l, and 
therefore that DF*(x)^F(x). 

Let us note also the following result: 

(2.2) Theorem. If f is a summable function and '!> is the indefinite 
integral of f, then D0(*)^/(a?) and $»(x) ^f(x) at any point x at which 
the function f is upper semi-continuous, and similarly I)$(x)^f{x) and 
Q.(x)^f (x) at any point x at which the function f is lower semi-continuous. 

In particular therefore, 0'(x)— 0' a (x) — D$(x)~ f(x) at any 
point x at which the function f is continuous. 
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In i? i there is no difference between ordinary and strong deri- 
vation. If F (x) is a finite function of a real variable, we understand 
by its extreme derivates F(x) and F(x), and by its derivative, or unique 
derivate, F'(x), the corresponding derivates of the function of an 
interval that F(x) determines (cf. Chap. Ill, § 13). Besides these 
derivates, which we shall often term bilateral, we also define, for 
functions of a real variable, unilateral derivatives and derivates. 
Thus, iiF(x) is a finite function of a real variable defined in the neigh- 
bourhood of a point x 0 , the upper limit of [ F(x ) — F(x 0 )]/(x — x 0 ) 
as x tends to x 0 by values of x '> x 0 is called right-hand upper derivate 
of the function F at the point x 0 and is denoted by F + (x 0 ). Similarly 
we define at the point x 0 the right-hand lower derivate F + (x 0 ) and 
the two left-hand, upper and lower, derivates, F (,r 0 ) and F (x 0 ). 
These four derivates are called unilateral extreme, or Dini, derivates. 
If the two derivates on one side (right or left) are equal, their 
common value is called unilateral ( right-hand or left-hand) derivative 
of the function F at the point in question. Finally, we shall call inter- 
mediate derivate of F(x) at the point x 0 , any number l such that 
there exists a sequence {x n } of points distinct from x 0 for which 
lim x n = x Q and lim [F(x„) — F(x 0 )]j(x n — x 0 ) — l. 

n n 

Let E be a linear set, x 0 a point of accumulation of E, and 
F(x) a finite function defined on E and at the point x 0 . The upper 
and lower limit of the ratio [F(ic) — F{x 0 )]l(x — x 0 ) as a; tends to x 0 
by values belonging to the set E, are called respectively the upper 
and lower derivate of F at x 0 , relative to the set E. We shall denote them 
respectively by F E (x 0 ) and F E (x 0 ). When they are equal, their 
common value is termed derivative of F at x 0 relative to the set E, 
and is denoted by F' E (x 0 ). 

Besides this derivation relative to a set, we define also 
derivation relative to a function. Suppose given two finite 
functions F(x) and V (x), and let x 0 be a point such that the func- 
tion V is not identically constant in any interval containing x 0 . 
We then call upper derivate Fu(x 0 ) and lower derivate Fu(x 0 ) of the 
function F with respect to the function V at the point x 0 , the upper 
limit and the lower limit of the ratio [F(<r) — F (*„)]/[ U(x) — U(x 0 )] 
as x tends to x 0 by values other than those for which F{x) — F(x 0 )= 
— TJ(x) — U(x 0 )=0. Similarly, considering unilateral limits of the 
same ratio, we define four Dini derivates of F with respect to U: 
F^(x), EZ(x), F^(x) and F^x). 
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When all these extreme derivates are equal, their common 
value is denoted by F'u(x 0 ) and called derivative of F with respect 
to the function U at the point x 0 . The most usual case in which this 
method of derivation is applied, is when U is a monotone increasing 
function; and it is then easy, by change of variable, to reduce deri- 
vation with respect to U to ordinary derivation. 

§ 3. Vitali’s Covering Theorem. We shall say that a family (E 
of sets covers a set A in the sense of Vitali , if for every point x of A 
there exists a regular sequence of sets ((£) tending to x (cf. p. 106). 

(3.1) Vitali 9 8 Covering Theorem . If in the space M m a family 
of closed sets (£ covers in the sense of Vitali a set A, then there exists 
in (£ a finite or enumerable sequence {E„} of sets no two of which have 
common points , such that 

(3.2) \A~yE n \=.0. 

n 

Proof, a) We first prove the theorem in the special case in 
which (i) the parameters of regularity of all the sets ((£) exceed 
a fixed number a>0 and (ii) the set A is bounded i. e. contained 
in an open sphere S. We may clearly assume that, in addition, all 
the sets ((£) are also contained in S. 

This being so, we shall define the required sequence { E » ) by 
induction in the following manner. 

For E x we choose an arbitrary set ((£), and when the first p 
sets E? lt E 2 , ... , E p no two of which have common points, have 
been defined, we denote by S p the upper bound of the diameters 

p 1 

of all the sets ((£) which have no points in common with £E iy and 

i— i 

by E p+ 1 any one of these sets with diameter exceeding d p /2. Such 
a set must exist, unless the sets E v E. „ ... , E p already cover the 
whole of the set A, in which case they constitute the sequence whose 
existence was to be established. We may therefore suppose that 
this induction can be continued indefinitely. 

To show that the infinite sequence {E„} thus defined covers A 
almost entirely, let us write 

(3.3) B=A—SEn 

■ 

and suppose, if possible, that |B|>0. On account of condition (i), we 
can associate with each set E„ a cube J„ such that EnQJn and 
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|J£ n |>a-|<7n|. Let J„ denote the cube with the same centre as J n 
and with diameter (4m+l) 6(J„). The series 

(3.4) 2’|J„|=(4m+l) m .2'|J n K(4OT+ir.a- 1 -2\E„\*Z (4m+l) m -a~ 1 

n n n 

00 ^ 

converges; therefore we can find an integer N such that V \J„\ <|f?|. 

n.-V+l 

/*V/ 

It follows that there exists a point x 0 eB not belonging to any J„ 
for n > N ; and since by (3.3) the point x Q does not belong to £E n 

n 

and the sets E„ are supposed closed, there must exist in C a set E 
containing x 0 and such that 

(3.5) E ■ En = 0 for n = 1, 2, ... , N. 

Hence the set E has common points with at least one of the sets E n 
for n>N; for otherwise we should have 0 < <5(2?)^d„^ 2d(E n +i)^. 
^ 2d(J n+{ ) for every positive integer «, and this is clearly impossible 
since by (3.4) we have lim 3(J„)= 0. Let n 0 be the smallest of the 

n 

values of n for which E E„^ti. Then on the one hand, E E n = 0 
for n = l, 2, ...,n 0 — 1, so that 

(3.6) <*(£)<< 

and on the other hand, by (3.5), rc 0 >iV, which implies, by definition 
of j? 0 , that does not belong to Thus we find that there are 
both some points outside J n „ and some points belonging to the set 
En o C J n„, which are contained in the set E\ this set must therefore 
have diameter exceeding 2<5(J„ o )^ 2<5 (£„„)> <5 „ 0 -i, in contradiction 
to (3.6). The assumption that \B\ > 0 thus leads to a contradiction 
and this proves the theorem, subject to the additional hypotheses 
(i) and (ii). 

b) Now let (£ be any family of closed sets covering the set A 
in the sense of Vitali; and let us denote, for any positive integer n, 
by S n the sphere S(0; n) and by A„ the set of the points 
for which there exists a sequence of sets tending to x and consisting 
of sets (d) whose parameters of regularity exceed 1 /«. The sets A„ 
constitute an ascending sequence and A=lim.4„. 

n 

We can now define by induction a sequence of families of 
sets \Z n ) subject to the following conditions: 1° each family Z n 
consists of a finite number of sets ((£) no two of which have common 
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points and none of which, for n> 1, have points in common with 
the sets of preceding families $ 2 , ... , 5„_i; 2° denoting by T n 
the sum of the sets which belong to Z„, 

n 

(3.7) \A„—ET,\<lln. 

<= i 

To see this, suppose that for n-^p we have determined families T„ 

~ A 

subject to 1° and 2°. Write A p +\ = A p+i — 2jT), and consider the 

<=i p 

family of all the sets ((£) which are contained in the open set C £T t and 

whose parameters of regularity exceed l/(p + .l). This family evid- 
ently covers the set A p+ i C -Ap+i C S p +i in the sense of Vitali, and 
by what we have already proved we can extract from it a sequence ( Et } 

/"V/ 

of sets, no two of which have common points, so as to cover ^. p +i 
almost entirely. Therefore, for a sufficiently large index i 0 , 

Mp-h -iTi- ZEiHAp+v -tn < l/(p + l), 

I 1 1—1 /= 1 

and, denoting by T p+ i the family consisting of the sets E lf E z , ... , E i(l , 
we find that conditions 1° and 2° hold when n — p-\-l. 

Let us now write The family T consists of a finite 

n 

number, or of an enumerable infinity, of sets ((£) no two of which 
have common points. Denoting the sum of these sets by T, we find, 
by (3.7), I-.1 — T| = 0 and this completes the proof. 

The proof given above ia due to S. Banach [3] (for other proofs cf. C. Ca- 
ratheodory f II, pp. 299 — 307] and T. Radb [3]). 

Theorem 3.1 was proved by G. Vitali [3] in a slightly less 
general form; he assumed the family (£ to consist of cubes. H. Le- 
besgue [5] while retaining the line of argument of Vitali, showed 
that the conclusion drawn by Vitali could be generalized as follows: 

(3.8) Theorem . Given a set A and a family (£ of closed sets, suppose 
that with each point xeA we can associate a number a>0, a sequence 
{X,,} of sets ((£) and a sequence {J n } of cubes such that 

xeJ„, X„C.Jn, |X,| > a • | J„| for n— 1 , 2 ,..., and lim3|J n ) = 0 . 

n 

Then '£ contains a sequence of sets no two of which have common 
points , that covers the set A almost entirely . 
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This statement, although apparently more general than 
that of Theorem 3.1, easily reduces to the latter. For let us denote 
by (E° the family of all the sets of the form E+(x), where E is any 
set of <E and x any point of the set A. The family (E° clearly covers 
the set A in the Vitali sense, and by Theorem 3.1 we can therefore 
extract from it a sequence {E%} of sets no two of which have common 
points dnd which covers the set A almost entirely. Now each set En 
either already belongs to (E, or becomes a set ((E) as soon as we 
remove from it a suitably chosen point. Therefore, removing where 
necessary a point from each K '', , we obtain a sequence of sets (C£) no 
two of which have common points and whose sum, since it differs from 

by an at most enumerable set, covers A almost entirely. 

n 

For farther generalizations of Vitali’s theorem (which, again, can be proved 
without introducing fresh methods), see B. Jessen,. J. Marcinkiewicz and 
A. Zygmund [1, p. 224]. 

It is easy to see that the hypothesis that the family (E covers the set A 
in the Vitali sense (and not merely in the ordinary sense) is essential for the 
validity of Theorem 3.1. But, as has been shown by S. Banach [1] and H. Bohr 
(vide C. Carath^odory [II, p. 689]), this hypothesis cannot be dispensed with 
in the theorem even in the case where (S is a family of intervals such that to each 
point x of the set J there corresponds a sequence {I n ) of intervals belonging to (E, 
of centre x and diameter tending to zero as n— >oo. 

For covering theorems similar to that of Vitali and which correspond to 
linear measure (length, cf. Chap. II, § 9) of sets, vide W. Sierpinski [7], 
A. S. Besicovitch [1] and J. Gillis [1]. 

# 4. Theorems on measurability of derivates. Of the 

two theorems which we shall establish in this §, the first is due to 
8. Banach [4, p. 174] (cf. also A. J. Ward [2, p. 177]) and concerns 
the extreme derivates of any function of an interval (not necessarily 
additive). We begin by proving the following lemma: 

(4.1) Lemma . Any set expressible as the sum of a family of intervals 
is measurable . 

Proof. Let 3 be any family of intervals and let 8 be the sum 
of the intervals of 3 - Let (S denote the family of cubes each of which 
is contained in one at least of the intervals ( 3 ). The set 8 is clearly 
covered by (£ in the Vitali sense and by Theorem 3.1, 8 is therefore 
expressible as the sum of a sequence of cubes ((£) and of a set of 
measure zero. Therefore the set S is measurable, as asserted. 

(4.2) Theorem . IfFis a function of am interval^its two extreme ordinary 
derivates F and F and its extreme strong derivates F* and F B are measurable. 
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Proof. Let us take first the strong derivates of F , say F s . 
Let a be a finite number and P the set of the points x for which 
F s (x)'>a, For any pair of positive integers h and k , let us denote 
by P h,h the sum of all the intervals I for which <5(/)^l/fc and 
P(/)//|^a + l//i. We see at once that P=2/77 P/i,a- Now the sets 

h k 

P h x are measurable on account of Lemma 4.1 and so is the set P. 
This proves F s (x) to be a measurable function. 

Consider next the ordinary upper derivate F. As before let a 
be any finite number, and Q the set of the points x at which F(x)> a. 
In order that a point x should belong to Q, it is clearly neces- 
sary and sufficient that there should exist a positive number a 
and a sequence (Z„} of intervals tending to x such that r {I n )^a 
and F(I n )l\In\^ a + a for w?==l, 2, ... . Hence denoting for any pair 
of positive integers h, k by $/,,* the sum of the intervals / such that 
r(Z) ^ 1//*, d(I) ^ IJk and F(Z)/|7| ^ a-f-l/A, we find easily that 
Thus, since each set Q lhk is measurable by Lemma 4.1, 

hk 

so is also the set Q. The derivate F(x) is therefore measurable. 

It follows in particular from Theorem 4.2 that the bilateral 
extreme derivates of any function of a real variable are measurable. 
The same is not true of unilateral extreme derivates. Neverthe- 
less, as shown by S. Banach [2] (cf. also II. Auerbach [1]), these 
derivates are measurable whenever the given function is so. Similarly 
by a theorem of W. Sierpiriski [8], the Dini derivates of a function 
measurable (B) are themselves measurable (B). These two results 
are included in the following proposition, from which they are 
obtained by choosing the class 96 to be either £ or B. 

(4.3) Theorem . If 96 is an additive class of sets in which includes the 
sets measurable (B), the I)ini derivates of any function of a real variable 
which is finite and measurable ( 96 ), arc themselves measurable ( 96 ). 

Proof. If F is any finite function of a real variable, x any 
point and /?, k any pair of positive integers subject to A >/?, let us 
write D, ltk (F'jX) for the upper bound of the ratio [F(t) — F(x)] (t — x) 
when x + l/k <t<Zx{ 1/7/ . At any point ./• we clearly have 

(4.4) F+(x) — lim lim /)/,.* (F: x). 

h k 

Now let a be any finite number and consider the set 

( 4 . 5 ) E[Df,,k(F] x) > a]. 
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We see at once that if the function F is constant on a set E, the 
set of the points x of E at which Dh,k(F-, x)>a is open in E (cf. 
Chap. II, p. 41). Thus the set (4.5), and consequently the expression 
Eh,k{F-,x) as function of x, is measurable (SE) whenever the function F 
is finite, measurable (St) and assumes at most an enumerable infinity 
of distinct values. 

This being so, let F be any finite function measurable (SE). 
We can represent it as the limit of a uniformly convergent sequence 
{F„} of functions measurable (SE) each of which assumes at most 
an enumerable i nfini ty of distinct values: for instance we may write 
F„{x)=ijn, when i/n^F(x)<(i+l)/n for i=... — 2, — 1,0, 1,2,.... 
We then have Dh,k(F-,x) — 1imBh,k(Fn',x), and since by the above 

n 

the functions Dh,k(F n -,x) are measurable (9E) in #, so is D hyk {F\x). 
It follows at once from (4.4) that the derivate F+(x) is also meas- 
urable (SC), and this completes the proof. 

§ 5. Lebesgue’s Theorem. We shall establish in this § the 
fundamental theorem, of Lebesgue on derivation of additive functions 
of a set and of additive functions of bounded variation of an interval. 

(5.1) Lemma . If for a non-negative additive function of a set 0 the 
inequality D &(x)^a holds at every point x of a set A, then 

(5.2) 0(X)^a-\A\ 

holds for every set X"Z)A, bounded and measurable (33). 

Proof. Let e be any positive number and b any number less 
than a. By Theorems 6.5 and 6.10 of Chap. Ill, there exists a bounded 
open set G such that 

(5.3) XC 0 and 0(X)^0(G) — e. 

Let us denote by <£ the family of closed _sets E(ZG for each 
of which 0(E)^b-\E\. Since by liyj)othesis, D 0(x)'^a'>b at any 
point xt-A, the family (£ covers the set A in the Vitali sense, and 
by Theorem 3.1, we can extract from it a sequence {E,,} of sets no 
two of which have common points, so as to cover almost entirely 
the set A. Therefore, on account of (5.3), 

<P(Z)> 0«7) — <V(E„) — t ^b Z\E„\ — f^b-\A\ — e. 

n n 

In this we make e~>() and b > a, and (5.2) then follows at once. 
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(5.4) Lebesgue'a Theorem. An additive function of a set is almost 
everywhere derivable in the general sense. An additive function of 
bounded variation of an interval is almost everywhere derivable in 
the ordinary sense. 

Proof. On account of Theorem 2.1 we may restrict ourselves 
to non-negative additive functions of a set. 

Let 0 be such a function and suppose that D<P(a?)>D<P(<r) 
holds at each point a; of a set A of positive measure. For any pair 
of positive integers h and k let us denote by .4*,* the set of the points 
x of A for which D4>{x)>(h+l)lk>hlk'>T>0{x). We have A=£Ah,k, 

h, A 

and therefore there exists a pair of integers h 0 and k 0 such that 
> 0. Let us denote by B any bounded subset of Ah„x of 
positive outer measure. Let e be any positive number and 0 
a bounded open set such that 

(5.5) BQO and |G|<|B| + f. 

Consider the family of all closed sets EQG for which 4>(E)<i^-\E\. 

This family covers the set B in the Vitali sense, and therefore con- 
tains a sequence { E„ ) of sets no two of which have common points, 
which covers the set B almost entirely. Writing Q =X we find, on 

n 

account of (5.5), 

( 5 . 6 ) m)=z 

n Kq n fC o ^0 

On the other hand, D<Z>(x)^ (h 0 +l)/k 0 at each point xeB. Therefore, 
since all but a subset of measure zero of the set B is contained in the 

set Q, it follows from Lemma 5.1 that <b(Q)^^—^--\B\, and therefore, 

Kq 

on account of (5.6) that (ft 0 -)-l)-|B|^/i 0 '(|J5| + f). But this is clearly 
a contradiction since |JB| > 0 and e is an arbitrary positive number. 

Thus the function has almost everywhere a general derivative 
D<P. It remains to prove that the latter is almost everywhere finite. 
Suppose the contrary: there would then exist a sphere S such that 
D<P(a:)=-f-oo at any point a: of a subset of S of positive measure. 
We should then have, by Lemma 5.1, $(S)=- |-oo, which is impos- 
sible and completes the proof. 
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The preceding theorem was proved by H. Lebesgue [I, p. 128] first 
for continuous functions of a real variable and later [5, p. 408 — 425] for additive 
functions of a set in R n . Among the many memoirs devoted to simplifying the 
proof we may mention: G. Faber [1], W. H. and G. C. Young [1], H. Stein- 
haus [1J, Ch. .1. de la Vallee Poussin |T, I; p. 103], A. Rajchman and 
S. Saks [1], J. Bidder [3] (cf. also the direct proof of Lebesgue’s theorem for 
additive functions of bounded variation of an interval in the first edition of this 
book). More recently F. Riesz [6; 7] has given an elegant proof of Lebesgue’s 
theorem for functions of a real variable. Finally S. Banach [4, p. 177] has exten- 
ded the theorem in question to a class of functions of an interval which is slightly 
wider than that of additive functions of bounded variation. The proof given 
here applies also without any essential modification to the theorem of Banach. 

For an extension of the theorem to certain abstract spaces, see J. A. Clark- 
son [1] (cf. also S. Bochner [3]) 

Another application of Lemma 5.1 is the proof of the fol- 
lowing theorem on term by term derivation of monotone sequences 
of additive functions: 

(5.7 ) Theorem • If an additive function of a set is the limit of a mono- 
tone sequence {#„} of additive functions of a set , then almost everywhere 
D = lim D <P n (x). 

n 

In the same way , if an additive function of bounded variation 
of an interval , F, is the limit of a monotone sequence { F n } of additive 
functions of bounded variation of an interval , then almost everywhere 
F'(x) = lim Fn(x). 

n 

Proof. Suppose, to fix the ideas, that the sequence {<P„} is 
non-decreasing and write To establish the first part 

of the theorem, we need only show that 

(5.8) limD@„(a?) = 0 

11 

almost everywhere. 

For this purpose, let A denote the set of the points x at which 

(5.8) is not satisfied and suppose that |A| > 0. For any positive inte- 
ger k we write .4* for the set of the points x at which limD^„(a;)^l/A:. 

11 

Since A=£Ai,, there is an index k — k 0 such that 0. Let B de- 

ft 

note any hounded subset of A*„ of positive measure and I an interval 
containing B. Since the sequence {£„} is non-increasing, so is the 
sequence {D®,.} and therefore we must have D8 n (r)^ l/fc 0 for 
n — 1,2, ... at any point xe BQ .4 * . Hence by Lemma 5.1, we find 
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S„(I) ^ \B\lk 0 for every positive integer n, and this is dearly a con- 
tradiction since |J5| > 0 and lim S„(I) = 0(1) — lini $,,(I) — 0. 

n n 

We might prove similarly the second part of the theorem, 
but actually the latter can be reduced at once to the first part. 
In fact if we suppose the given sequence (jPn) non-decreasing and 
write Tn~F — F n , then the functions of an interval, T„, are non- 
negative and the sequence { T n (I ) } converges to 0 in a non-increasing 
manner for every interval I. The sequence [T*) of additive and 
non-negative functions of a set is then also non-increasing and con- 
verges to 0 (cf. Chap. Ill, Theorem 6.2). Hence, by the first part of 
our theorem and by Theorem 2.1, we have lim Tn(%)=limDTn(x)=Q v 

n n 

and therefore lim F' n (x)=F'(x), for almost all x, which completes 

n 

the proof. 

For functions of a real variable. Theorem 5.7 may be stated in the fol- 
lowing form ( vide G. Fubini [2]; cf. also L. Tonelli [3] and F. Riesz [6; 7]): 

If F(x)=^F n {x) is a convergent series of monotone non-decreasing functions , 

n 

then the relation F'(x)= yFn(x) holds almost everywhere. 


§ 6. Derivation of the indefinite integral. Given a set A, 
let us write La(X) — \A-X\ for every measurable set X. The function 
L a of a measurable set, thus defined, is termed measure-function 
for the set A. Considered as function of a measurable set, or as 
function of an interval, La is additive and absolutely continuous; 
and, if further the set A is measurable, we have La(X) = ^c A (x)dx 

for every measurable set X, i. e. the function L A is the indefinite 
integral of the characteristic function of the set A. 

(6.1) Theorem. For any set A we have 

(6.2) DL^(a?) = q a {x) 

at almost all x of A; and if further , the set A is measurable , then (6.2) 
holds almost everywhere in the whole space. 

Proof. By Theorem 6.7 of Chap. Ill the set A can be enclosed 
in a set H e © 4 for which the measure-function is the same as for 
the set A. Let us write E—fl G n , where {O n } is a descending 
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sequence of open sets. We clearly have DL^ n (a;)=l at any point xeOn, 

and so a fortiori at any point x e A C H C Hence, remembering 

that the sequence {L^} of functions of a set is non-increasing and 

converges to the function Lu= L w, it follows by Theorem 5.7 that 
DL,*(a;) — lim DL^ (x) = 1 = c^#) almost everywhere in A. 

rt n 

Now suppose the set A measurable. Then Im(X) -f Lca(X) = |2T| 
for every measurable set X, and consequently DL^(») + DLc^(a?)=l 
at any point x at which the two derivatives Dh A {x) and DLca(*) 
exist, i. e. almost everywhere by Lebesgue’s Theorem 5.4. Now, by 
what has been proved already, DLca{x) = 1 almost everywhere 
in CA. Therefore DL^(x) = 0 — 0 , 1 ( 0 ;) at almost all a; of C A and 
this shows that (6.2) holds almost everywhere in the whole space. 

(6.3) Theorem. If <I> is the indefinite integral of a summahle 
function f, then 

(6.4) T)Q(x) — f(x) 

at almost all 'points x of the space. 

Proof. We may clearly assume that / is a non-negative function. 
If / is the characteristic function of a measurable set, the relation (6.4) 
holds almost everywhere by the second part of Theorem 6.1. The rela- 
tion therefore remains valid when / is a finite simple function, i. e. 
the bnear combination of a finite number of characteristic functions. 
Finally, in the general case, any non-negative summable function / 
is the limit of a non-decreasing sequence {f,,} of finite simple measur- 
able functions; therefore, denoting by 0„ the indefinite integral of 
it follows from Theorem 5.7 that IHP(j?) = lim I)$„(x) — lim fr,(x) — f(x) 

II ll 

almost everywhere, and this completes the proof. 


§7. The Lebesgue decomposition. In this § we shall give 
for additive functions of a set ('13), a more precise form to the 
Lebesgue Decomposition Theorem 14.6, Chap. I. We shall prove in 
fact that the absolutely continuous function which occurs in this 
theorem is the indefinite integral of the general derivative of the 
given function. At the same time we shall establish the corresponding 
decomposition- for additive functions 0 ! bounded variation. 
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(7.1) Lemma. If 0 is a singular additive function of a set , then 
D0(x) = 0 almost everywhere. 

Proof. We may assume (cf. Chap. I, Theorem 13.1 (1°)) that 
the function 0 is non-negative. 

The function 0 being singular, there exists a set E 0 measur- 
able (8) and of measure zero, such that 

(7.2) 0(X-CE o ) = O for every set X bounded and measurable ("8). 

Suppose that the set of the points x at which D<P(#)>0 has 
positive measure. Then denoting by Q„ the set of the points x e CE 0 
at which D#(a;)> 1 /to, there exists a positive integer X such that 
\Qn\ > 0. Consequently, there also exists an interval I such that 
\I ■ Qy\ > 0, and by Lemma 5.1 we find 0(1 • CE 0 )^ 0(1 -Qn)^ 
^ \I ■ Qn\/N> 0, which clearly contradicts (7.2). 

(7.3) Theorem. If 0 is an additive function of a set, the derivative 
D0 is summable, and the function 0 is expressible as the sum of its 
function of singularities and of the indefinite integral of its general 
derivative. 

Proof. By Theorem 14.6, Chap. I, we have 0=6+0 where 6 
is a singular additive function of a set and 0 is the indefinite integral 
of a summable function f. Hence, making use of Theorem 6.3 and 
of Lemma 7.1 we find almost everywhere D^(x)=D0(a;)+D?P'(a;)=/(x) 
and this proves the theorem. 

We can extend the theorem to additive functions of bounded 
variation of an interval. We have in fact: 

(7.4) Theorem. If F is an additive function of bounded variation 
of an interval, the derivative F' is summable, and the function F is 
the sum of a singular additive function of an interval and of the in- 
definite integral of the derivative F'. 

Moreover, if the function F is non-negative, we have for every 
interval /„ 

(7.5) F(I 0 )^fF'(x)dx, 

h 

equality holding only in the case hi which the junction F is absolutely 
continuous on I 0 . 
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Proof. We may clearly assume the function F to be non- 
negative in both parts of the theorem. The corresponding function 
of a set F*, together with its function of singularities, will then also 
be non-negative; and on account of Theorems 7.3 and 2.1 we shall have 

(7.0) F*(X)p* fl)F*(x)dx — f F'(x)dx for every bounded set A’ e ©. 

x x 

Let us write for any interval 1 

(7.7) T(l) = F(l) — f F'( jc) dx. 

i 

The function of an interval thus defined is clearly non-negative, 
since by (7.0) 

F(I)^F*(1 C )^ [f'(x) dx = f F'(x) dx 
/' D t 

for every interval I. Moreover, if we take the derivative of both 
sides of ( 7. 7 ) in accordance with Theorem 6.3, we find D2’*(/) T'(x)— 0 

almost everywhere. It therefore follows from Theorem 7.3 that the 
function of a set T* — and so by Theorem 12.6, Chap. Ill, the 
function of an interval T — are singular. The relation (7.7) therefore 
provides the required decomposition for the. function F. 

Finally, since the function T is non-negative, it follows from 
(7.7) that the inequality (7.5) holds for every interval I 0 , and re- 
duces to an equality if, and only if, T(l) — 0 for every interval 
ICI 0 - other words, in order that there be equality in (7.5) it 
is necessary and sufficient that the function F be on /„ the inde- 
finite integral of its derivative, i. e. be absolutely continuous on /„. 

Theorem 7.4 provides a decomposition of an additive function 
of bounded variation of an interval into two additive functions 
one of which is absolutely continuous and the other singular. Just 
as for functions of a set, this decomposition is termed Lebesgue 
decomposition and is uniquely determined for any additive function 
of bounded variation. For suppose that (7,4 Tj=(j 2 -t- T 2 where (7 1 
and G 2 are absolutely continuous functions and T 1 and T 2 are sin- 
gular functions; then G 2 — G l =T 1 — T 2 and by Theorem 12.1 (2°, 6°), 
Chap. Ill, this requires G t = G 2 and T,= T 2 . The absolutely con- 
tinuous function and the singular function occurring in the Lebesgue 
decomposition of a function of bounded variation F are called, 
respectively, the absolutely continuous part and the function of sin- 
gularities of the function F. 
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As a special case of Theorems 7.3 and 7.4, let us mention the 
following result, in which part 2° includes Lemma 7.1 and its 
converse. 

(7.8) Theorem . 1° An additive junction of a set , or an additive function 
of an interval of bounded variation , is absolutely continuous , //, and 
only if, it is the indefinite integral of its derivative . 

2° An additive function of a set , or an additive function of an 
interval of bounded variation , is singular if, and only if, its derivative 
vanishes almost everywhere . 

Finally, let us mention also an almost immediate consequence 
of Theorems 7.3 and 7.4: 

(7.9) Theorem . The derivative of the absolute variation of an ad- 
ditive function of a set, or of an additive function of an interval of 
bounded variation , is almost everywhere equal to the absolute value 
of the derivative of the given function . 

Proof. Consider to fix the ideas, an additive function of bound- 
ed variation of an interval, F. Let T be the function of singular- 
ities of F, and let W and V be the absolute variations of the func- 
tions F and T respectively. In virtue of Theorem 7.1 the relation 

F(I) = I F'{x)dx + T(I) holds for every interval I, and hence also 
/ 

(7.10) W(I)^[\F'(x)\dx + V(I). 

i 

Now the function V is singular together with T, so that its deriva- 
tive vanishes almost everywhere by Lemma 7.1. Hence taking the 
derivative of (7.10), we find on account of Theorem 6.3 that 
W'(a)^ |P'(oj)| almost everywhere, and this completes the proof 
since the opposite inequality is obvious. 

§ 8. Rectifiable curves. By a curve in a space ll,„ we shall 
mean any system C of to equations x,=X,{t) where i=l,2, ... ,m and 
the Xj(t) are arbitrary finite functions defined on a linear interval 
or on the whole straight line lt v The variable t will be termed para- 
meter of the curve. The point (X \(t) } X t (t ), ... , X m (t)) will be called 
point of the curve corresponding to the value t of the parameter, 
and denoted by p(C; t). If E is a set in R v the set of the points 
p (C]t) for teE will be called graph of the curve C on E and de- 
noted by B(C f ; E) (cf. the similar notation for graphs of functions, 
Chap. Ill, p. 88). 



122 


CHAPTER IV. Derivation of additive functions. 


For simplicity of wording we consider in the rest of this §, 
only curves in the plane i? 2 ; we shall suppose also that the functions 
determining these curves are defined in the whole straight line B v 
But needless to say, these restrictions are not essential for the 
validity of the proofs that follow. 

Let therefore C be a curve in the plane, defined by the equa- 
tions x=X(t), y=Y(t). Given any two points a and &> a, a finite 

sequence r = {< y } ;=0 ,i n of points such that a = t 0 ^.t 1 ^...^t„=b 

will be called chain between the points a and b, and the number 

n 

(Pj- i , Pi), where Pj=v(C; tj), 

;--i 

will be denoted by o(C ; i). (We may regard this number as the 
length of the polygon inscribed in the curve C and whose vertices 
correspond to the values a=t 0 ,t v ...,t n =b of the parameter.) 
The upper bound of the numbers o(C)i) when t is any chain 
between two fixed points a and b, will be called length of arc of the 
curve C on the interval I— [a, b], and will be denoted by S(C; Z) 
or S(C; a, b). If S(C; Z) =f = 00 the curve C is said to be rectifiable on 
the interval Z; and if this is the case on every interval we say sim- 
ply that the curve C is rectifiable. 

(8.1) For any curve C we have S(C; a, &)+S(C; b, c)=S(C; a, c) 
whenever «<6<c. 

It is enough to prove that S(C; a, 6)+S(C; b, c)^S(C; a, c), 
since the opposite inequality is obvious. Let T={a=t 0 ,t 11 ... ,t„ = c} 
be any chain between a and c, and let h be the index for which 
ti,~i^b<t h . Writing T l ={a=t 0 ,t l , ... ,t h - U b) and T t ={b,t hl ... ,f„=c), 
we have o(C; iOO((7; r 1 )-fa(G; t 2 )^S(C; a, &)-{- S(C; b, c), and so 
S(C; a, c)^S(C; a, ft) + S(C; b, c). 

It follows from (8.1) that if a curve C is rectifiable, the length 
of arc S(C; I) is an additive function of the interval Z. We shall 
call this function length of the curve C. Any function of a real vari- 
able that corresponds to this function of an interval, i. e. any 
function S(t) such that S{b) — S(a)=S(C; a, b) for every interval 
[a, b], will also be termed length of the curve C. 
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(8.2) Jordan’s Theorem. If C is a curve given by the equations 
x—X(t),y = Y(t), we have 

W(Z; J)<S(C; I), W (Y; i)<S(C; 7), 

' ) W(Z;7)+W(Y;Z)^S(C;Z) 

/or any interval 7; and therefore, in order that the curve C be rectifiable 
on an interval I 0 , it is necessary and sufficient that the functions X 
and Y be of bounded variation on Z 0 . 

Proof. Given an interval 7 = [a, b], we easily find that for 
any chain i={a=t 0 ,t 1 ,...,t n =b) between the points a and b, 

i|Z« y )-Z«y- 1 )|<o(C; *), i|Y(f y )-r(M|<o(0;t), 

y= i y--< 

a(0; T)<i|Z(f y )-Z(t y „Ol + iV(«y)-y(^i)l<W(X; Z)+W(Y;Z), 

- y-i y- 1 

from which the inequalities (8.3) follow at once. 

(8.4) Theorem. If V is a rectifiable curve given by the equations 
x—X(t),y—Y(t), and the function S(t) is its length, then 

(i) in order that S(t) be continuous at a point [ absolutely contin- 
uous on an interval J it is necessary and sufficient that the two func- 
tions Z(f) and Y(t) shoxdd both be so-, 

(ii) we have il(B(C’; E)) <[|$[i?]| for any linear set E (i.e. on 
any set E the length of the graph of the curve C does not exceed the 
measure of the set of values taken by the function S(t) for teE)-, 

(iii) [S'(t)f=[X'(t)f+[Y'(t)f for almost every t; 

b 

(iv) S(C; a, b) > fV[X'(tjf+[ F(7)] 2 dt for every interval [a, b], 

a 

and the sign of equality holds if, and only if, both the functions Z(t) 
and Y (t) are absolutely continuous on [a, b ]. 

Proof, re (i): This part of the theorem is an immediate con- 
sequence of the relations (8.3), since by Theorems 4.8 and 12.1 (1°) 
of Chap. Ill, a function of bounded variation is continuous at a point 
[absolutely continuous on an interval] if, and only if, its absolute 
variation is so. 
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re( ii): Let e be any positive number and let jin} be a sequence 
of intervals such that 

(8.5) 8[E]CZl« »nd (8.6) |S[^]| + *. 

n n 

We may clearly assume that no interval I„ has a diameter exceeding t. 
This being so, we write L’=B(C; E) and we denote by E n 
the set of the points p(C;t) of the curve C for each of which S(t)el n . 
It is easy to see, on account of (8.5), that EQ^E n . On the other 

n 

hand d(E„)^.d (/„)— |Z„|^e for any », and it follows from (8.6) 
(of. Chap. II, $8) that A l,) (E)^ |<S[JSJ| + f, and hence by makings 
tend to zero, that A{E) |$[I?]|, as asserted. 

/r(iii): Let / 0 =[«<»M * )e an y interval. We shall show suc- 
cessively that both the relations 

(8.7) [ 4 r«)] 2 >[.r (*)]*+[ r«)3 2 and (8.8) [S'(f)] 2 <[X'(t)] 2 +[r«)P 

hold almost everywhere in Z 0 ; here the derivatives S'(t), X'(t) 
and Y'(t) exist almost, everywhere on account of Theorem 8.2 and 
of Lebesgue’s Theorem 5.4. 

We have S(t + *)—«(<» [[X(t + h)-X(t)f + [ Y(t+h)-Y(t)f}'* 
for any point t and any h > 0, and if we divide both sides by h and 
make h -> 0, this implies the relation (8.7) at any point t for which all 
three functions are derivable at the same time, i. e. almost everywhere. 

Now let ^1 denote the set of the points tel 0 at which the three 
derivatives X'{t), Y'(t) and S’(t) exist without satisfying the inequal- 
ity (8.8); and for any positive integer n, let A„ denote the set of 
the points 1 1 .4 for each of which the inequality 

S(I)/\1\ > {[X(I)l\I\f + [ Y(I)l\I\f} ii2 + 1 In 

holds for all the intervals I containing t, whose diameters are less 
than 1/tt. Clearly A=^A n . 

n 

Keeping n fixed for the moment, let e be any positive number. 
There exists a chain r— (o 0 = t 0 , f„ ... , t p = b 0 ) such that <* — f*-i <1 /» 
for h= 1,2, ...,p, and such that S(C;I 0 )^<J(C;t)+f. Consequently, 
writing for brevity </*=[«*_„ <J, p„= p(C;<*), ( , *=C(P*_p P*)» we have 

(8.9) 8(Ji,)'^ei,+\Ji>\l n for 1e= l,2,...,p, whenever J k -A n 4=0, 

and on the other hand 

1 8 <J*)< 

A»1 km 1 


( 8 . 10 ) 
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Therefore if stands for a summation over ail the indices k 

k 

for which Jk-An + 0, we find on account of (8.9) and (8.10) that 

Kl <2? n) \J k Kn.Z? n \S(J' k )-Q k ] <n-Z[S(J k ) — <g < 

k k A— 1 

Now since e is an arbitrary positive number, it follows that 
|JL w | = 0 for any n, and therefore also that \A\ = 0. Thus the in- 
equality (8.8) holds almost everywhere, as well as the inequality (8.7), 
and this completes the proof of part (iii) of the theorem. 

Finally, since S (C) a, b)=S(b) — $(a), part (iv) reduces on 
account of (i) and (iii) to an immediate consequence of Theorem 7.4. 

Theorem 8.4 (in particular its parts (iii) and (iv)) is due to L. Ton ell i 
[1; 4]; cf. also F. Riesz [6; 7]. 

As regards part (ii) of the theorem, it may he observed that in t lie case 
in which the curve C has no multiple points (i. e. when every point of the curve 
corresponds to a single value of the parameter) the inequality 4jB(C; E)\ |8[£]| 

can easily be shown to reduce to an equality. 

As proved by T. Wazewski [Ij, any bounded continuum Pot finite 
length may be regarded as the set of the points of a curve C on the in- 
terval [0, 1], such that S(0;0,1) 2*d(P). 

§ 9. De la Vall6e Poussin’s theorem. With the help of 
the results established in the preceding § we can complete further, 
for continuous functions of bounded variation of a real variable, 
the decomposition formula of Lebesgue. 

We shall begin with the following theorem, which itself com- 
pletes, in part at any rate, the second half of Lebesgue’s Theorem 5.4. 

(9.1) Theorem. If F(x) is a function of bounded variation and W (jr) 
denotes its absolute variation , then for the set N of the points at which 
the function F(x) is continuous but has no derivative finite or infinite , 
we have 

(9.2) | F *( N )| = W*{N) |N| = 0 and (9.3) A{b(F; N ) ) = (>. 

Proof. Consider the curve C : x= x, y—F(x). Let S(x) be 
the length of this curve and let E be the set of the values assumed 
by the function S(x). For any denote by X(s) the value of x 
for which S(x)=s and write Y(s)=F(X(s)). Since (cf. Theorem 8.2) 

| A" (8 % ) — A(*i)|^|* t — s t | and | r(# 2 ) — r(# l )|^|* 1 — s x \ for any pair 
of points * lt $2 of E, the functions X(s) and 3f(#) are continuous 
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on the set E, and moreover may be continued on to the closure E 
of this set by continuity. If now [a, 6] is an interval contiguous 
to E, the function X($) assumes equal values at the ends of this 
interval, and the point x=X(a)—X(b) is a point of discontinuity 
of the function F(x). We shall complete further the definition of 
the functions X{s) and Y (s) on the whole straight line if, so as 
to make the former constant and the second linear, on each interval 
contiguous to E. 

This being so, consider the curve C x given by the equations 
x=X(s), y=Y(s). We verify easily that the parameter s of this 
curve is its length. (Actually we see easily that the graph of the 
curve C, is derived from that of the curve C by adding to the latter 
at most an enumerable infinity of segments situated on the lines 
x—Cj where c, are the points of discontinuity of the function F.) 
By Theorem 8.4 (iii), we therefore have [X'(s)] 2 + [r'(s)f=l for 
almost all s, and therefore the set H of the points s for which either 
one of the derivatives X'{s) and Y'(s) does not exist, or both exist 
and vanish, is of measure zero. 

Now we see at once that if seE — H , then at the point x=X(s), 
the derivative F'(x) exists (with the value Y'(s)IX'(s) if X'(s)=}=0, 
or with the value ±oo if X'(s)=0 and Y'(s)^O). Therefore 
iVC-Y[JE-5], or what amounts to the same, S[N]CLE-H, and 
hence < \H\ = 0. 

From this we derive at once with the help of Theorem 8.4 (ii) 
the relation (9.3), since B(F; N)=B(C-, N). Finally, the function 
8(x ) is continuous (cf. Theorem 8.4 (i)) at any point at which the 
function F(x) is continuous, and so at any point of the set N, and 
therefore it follows from Theorem 13.3, Chap. Ill, and from Theo- 
rem 8.2, that |F*(N)|^W*(A t )^<S'*(A)=|j 8[A]| = 0; this completes 
the proof. 

(9.4) Lemma. If F(x) is a function of bounded variation , then 

(i) F*{A)^k-\A\ for any bounded set A and any finite number k 
whenever the inequality F'(x)'^k holds at every point x of A (and 
the assertion obtained by changing the direction of both inequalities 
is then evidently also true), 

(ii) F*(B)= 0 for any bounded set B of measure zero throughout 
which the derivative F'(x) exists and is finite. 
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Proof, re (i). Let e be any positive number. Denote, for any 
positive integer », by A w the set of the points xeA such that 
E(I)^(Jc — e )-\I\ holds whenever I is an interval containing x and 
of diameter less than 1 /». Clearly A — lim J. <n) . 

n 

Keeping the index n fixed for the moment, let us denote by <? <n> 
a bounded open set containing A (n \ such that (cf. Theorem 6.9, 
Chap. Ill) 

(9.5) |P*(X)— F*(A <n) )|<£ whenever A {n) C^CO { "\ 

and let us represent 6 {n) as the sum of a sequence 1 , 2 , ... of non- 

overlapping intervals. We may clearly suppose that all the inter- 
vals Jp"* are of diameter less than 1 /» and that their extremities 
are not points of discontinuity of -F(o5). So that if ]>? n) stands for 

P 

summation over the indices p for which lj, n) . _4. (n) 4=0, and X {n) 
denotes the sum of the intervals I™ corresponding to these indices, 
we find F*(XW)=£ n> F(I™)^(lc— t)-£ n) \C\^(k — t)-\A w \, and 

P P 

hence on account of (9.5), F*(A {n) )^>(Jc — «)-|A (n) | — f. Making e-> 0 
and n->oo, we obtain in the limit F*(A) ^ k- A\. 

re(ii). Let B {n) denote the set of the points xeB for which 
F'(x)'^‘ — n. By (i) we have F*(B {n) )P = — n |B (n) | = 0 for any positive 
integer n, and so F*(B)^ 0. By symmetry we must have also 
0, and therefore F*(B)= 0. 

(9.6) De la Valine Poussin’s Decomposition Theorem. If F(x) 
is a function of bounded variation with W (x) for its absolute varia- 
tion, and if E +oa and E - « denote the sets in which F(x) has a deriv- 
ative equal to -f 00 and to — 00 respectively , then 

(i) for any bounded set X measurably (5B) at each point of which 
the function F(x) is continuous, we have the relations: 

(9.7) F*(X)=F*(X-E +co )+F*(X E »)+ J F’(x) dx, 

X 

(9.8) W*(X)=F*(X-E +00 ) + \F*(X-E~ 0a )\ + [\F'(x)\dx; 

x 

(ii) the two derivatives F‘(x) and W'(x) exist and fulfil the rela- 
tion W'(®)=|f ,, (x)| at any point x of continuity of F, except at 
most at the points of a set N such that W*(A r )=|J7| = 0 (i.e. a set 
which is at the same time of measure (L) zero and of measure (W) zero). 
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Proof. re(i). On account of Theorem 7.3 there exists a set A 
of measure zero such that for any set A' bounded and measurable ('8) 

(9.9) F*(X)—F*(X-A) + [f'(jc) dx. 

x 

Supposing further that the function F is continuous at every 
point of A', we may assume, in virtue of Theorem 9.1, that the function 
F(x) has everywhere in A, a derivative, finite or infinite. Moreover 
since by Lemma 9.4 (ii) the function F*( X) vanishes for any bounded 
subset of A — (E+oa+E -«>), we may assume that 
Finally, we obtain directly from (9.9) that F*( X) vanishes when 
A' CCA and |A'| = 0. We may therefore choose simply A—E ¥oo -\-E » 
and formula (9.9) becomes (9.7). 

Formula (9.8) is, by Theorem 13.2, Chap. Ill, an immediate 
consequence of (9.7), since Lemma 9.4 (i) shows that F*(X-E +oo )~^>0 
and F*(X-E ooJ-^O for every set A' bounded and measurable (8). 

re(ii). Let N be the set of the points x at which the function 
F(x) is continuous and either one at least of the derivatives F'(,r) 
and W'(jr) does not exist, or both exist but do not satisfy the re- 
lation W'(.r)=!F'(j")'- We then have N-E +00 = A'-L’_ 00 = 0, since 
evidently W'(j?) = + oo=|F'(.r)| at any point where F'(x)=±oo. 
Therefore, since the set X is further, by Theorem 7.9, of measure (L) 
zero, it follows from formula (9.8) that W*(A T ) = 0 and this com- 
pletes the proof. 

Let us mention an immediate consequence of Theorem 9.6. In order that 
a continuous function of bounded variation F (x) be absolutely continuous , it is neces- 
sartf and sufficient that the function of a set F*(X) should vanish identically on the 
set of the points at which F(x) has an infinite derivative. In particular therefore . 
any continuous function of bounded variation which is not absolutely continuous 
has an infinite derivative on a non -enumerable set. 

Let us remark further that the theorems of this § cannot be extended directly 
to additive functions of an interval in the plane. Thus if F(I) denotes the con- 
tinuous singular function of an interval in /f 2 , which for any interval / equals 
the length of the segment of the line x— y contained in /, we have F(x) 0 
for every point j\ so that F'( jr)^oo does not hold at any point. 

jj 10* Points of density lor a set. Given a set E in a space 
li m , the strong upper and lower derivates of the measure-function of E 
(cf. § 6, p. 117) at a point x will be called respectively the outer upper 
and outer lower density of E at x. The points at which these two den- 
sities are equal to 1 are termed points of outer density , and the points 
at which they are equal to 0 , points of dispersion , for the set E. 
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If the set E is measurable we suppress the word “outer” in these 
expressions. We see further, that if the set E is measurable, any 
point of density for E is a point of dispersion for C E, and vice-versa. 

We shall show in this § (cf. below Theorem 10.2) that almost all points 
of any set E are points of outer density for E, or what amounts to the same that 

(10.1) For almost all points x of E , if |/ n [ is any sequence of intervals tendinq 
to x (in the sense of § 2, p. 106), we have \E-I n \/\I n \-+l. 

This proposition presents an obvious analogy to Theorem 6.1 and it is 
in the form (6.1) that the “density theorem** is often stated and proved either 
with the help of Vitali*s Covering Theorem or by more or less equivalent means 
(vide, for instance, E. W. Hobson [I], Ch. J. de la Vallee Poussin [I, p. 71J 
and W. Sierpinski [101). Theorem 10.2 will be however, so to speak, independent 
of Vitali’s theorem, because the sequences of intervals occurring in (10.1) are 
not supposed regular. 

Also, Theorem 10.2 will be more precise than Theorem 6.1, because at 
any point x of outer density for a set E we have a fortiori DL E (x)=l, and in- 
deed, at any x the relations \^ E (x)~\ and Y>\j e (x) — 1 are equivalent. To see 
this it is enough to show that the former of these relations implies the latter, 
the converse being obvious from Theorem 2.1. We may assume moreover, on 
account of Theorem 6.7, Chap. Ill, that the set E is measurable. 

Let therefore x 0 .be a point such that L E (x 0 ) — 1 and let (A n J be a regular 
sequence of measurable sets tending to x 0 . Then there exists a sequence {J n \ of 
cubes such that X n C.J n and |AJ/|JJ>a, where n — 1, 2, ... and a is a fixed 
positive number. Since by hypothesis, \E-J n \f\J„\ — >1, and so \CE»J n \/\J n \—*0, 
it follows that \CE-X n \/\X n |->0, or what amounts to the same that | E- X n \/\X n \ — > 1 . 
Therefore DL fc .(a; 0 ) = l. 

It is, of course, only for spaces H m of dimension number m ^ 2 that Theo- 
rem 10.2 will differ from Theorem 6.1. The two statements are equivalent for H v 

For the various proofs of Theorem 10.2 vide F. Riesz [8J and H. B use- 
man n and W. Feller [1]. In the second of these memoirs will be found a 
general discussion of the different forms of “density theorems”. 

It is of interest to observe that the proposition (10.1) ceases to be true 
even for closed sets E, if the intervals I n are replaced by arbitrary rectangles 
with sides not necessarily parallel to the axes of coordinates. This remarkable 
fact has been established by O. Nikodym and A. Zygrnund ( vide O. Nikodyin 
[1, p. 167]) and by II. Busemann and W. Feller [1, p. 243]. 

(10.2) Density Theorem . Almost all the points of an arbitrary set E 
are points of outer density for E ; and if further the set E is measurable , 
almost all the points of C E are points of dispersion for E . 

Proof. For simplicity of notation, we shall state the proof 
for sets which lie in the plane; the corresponding discussion in any 
space i£„, is however essentially the same (except that in lt xi as 
already remarked, the theorem reduces to Theorem 6.1). 
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By Theorem 6.7, Chap. Ill, any set can be enclosed in a meas- 
urable set having the same measure-function. On the other hand 
any measurable set is the sum of a set of zero measure and of a 
sequence of bounded closed sets. We may therefore suppose that 
the set considered is bounded and closed. 

Let e be any positive number. We shall begin by defining 
a positive number a and a closed subset A of E such that | E — A\<e 
and that, for any point (£, »?) in the plane, 


(i) 


|E[(®, rj) e E-, a-^x^b]\ ^ (1 — e)(b — a) 

X 

whenever (£,r])eA, a^.£*S£.b and b — a<^o. 


To do this, let us write for brevity, when Q is any set in the 
plane, Q [r>] ='E l [(x, rj) e Q], and let us denote, for any positive 

X 

integer n, by A„ the set of the points {£, tj) of E such that 
— e)-\I\ whenever 7 is a linear interval containing £ 
and of diameter less than 1/n. The sequence {A,}, is evidently 
ascending. Let us write 

(10.3) N—E — lim^l,,. 

n 

For any 17, if £ is a point of the set W 1 ’' 1 , there will then exist, 
for each positive integer n, a linear interval 1 such that £ e 7, 
6(1) < 1/n and ITV'*’ 71 -7|^|jE 7 ,v, -71 <(1 — e)-|7|. Therefore the lower 
derivate of the measure-function for the linear set 77 ,,?l cannot 
exceed 1 — e at any point of this set, whence by Theorem 6.1, 

(10.4) |77 ,,?I |=|E[(*, Tj)e W]|=0 for every real number *7. 

X 

Let us now remark that the sets A n are closed. For, keeping 
for a moment an index n fixed, let (£0, r] 0 ) be the limit of a sequence 
{(£*>’?*)}*= 1,2 points of A„. Let 7 0 be a linear interval such 

that £ 0 el 0 and d(7 0 ) < 1 jn, and let 7 be any linear interval con- 
taining 7 0 in its interior, whose diameter is less than 1/w. Then for any 
sufficiently large k, £ k 1 1 and therefore |E^'*^-7|^(1 — f)-|7|. On the 
other hand, the set E being closed, we easily see that 
-E^DlimsupE 1 ''* 3 ; so that \E [, ‘^-1^ lim sup \E U ^-1\^(1— 1)-|7|, 

At k 

and therefore also l 7? , ’ ? " , -7 0 ^ (1 — c)- 17 0 |. Hence (r 0 , r l0 ) e A„, i. e. 
each set A„ is closed. 

It follows, according to ( 10 . 3 ), that the set N is measurable. 
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Therefore, applying Fubini’s theorem in the form (8.6), Chap. Ill, 
we conclude, on account of (10.4), that the plane set N is of meas- 
ure zero. Consequently | E — A „ J < e for a sufficiently large index n # , 
and writing a=l/n 0 and A =A„ V we find that the inequality \E — A\<_ e 
and condition (i) are both satisfied. 

In exactly the same way, but replacing the set E by A and 
interchanging the role of the coordinates x and y, we determine 
now a positive number < a and a closed subset B of A such that 
\A — J?|<e and that, for any point (£, tj) in the plane 


(n) y 

whenever (g,t])eB, a*^.t]<^b and b — a < a v 


This being so, let (£ 0 , rj 0 ) be any point of B. Let J =[a„ a 2 , /? 2 ] 

denote any interval such that (£ 0 , r lo) f J and d(J)<o 1 <o. By Fubini’s 
theorem (in the form (8.6), Chap. Ill) we have 


(10.5) 



Since (£ 0 ,t] 0 )eB and a 2 s^ »? 0 ^/S 2 , it follows from (ii) that the 
set of the y such that (£ 0 ,y)eA and a 2 <;y<; /J a is of measure at 
least equal to (1 — f) (/3 2 — a 2 ). On the other hand, since 
it follows from (i) that |E[(a>, y)e E] a 1 ^a?^|S 1 ][^(l — £)(& — o 2 ) 

or 

whenever (£ 0 , y)eA. Hence, formula (10.5) gives 


\E-J\ > (1 — 0* (Pi- «i) (/»,-«,)=(!— «)*• ! J\, 

i. e. the lower density of E is at least equal to (1-— f) 2 at any 
point (£ 0 , Vo) °f B. Therefore, since j E — B\^\E — A\ + \A — jB|^2c 
and since e is an arbitrary positive number, the lower density of E 
is exactly equal to 1 at almost all the points of the set E, i. e. 
almost all points of this set are points of density for it. 

The second part of the theorem is an immediate consequence 
of the first part. In fact, if the set E is measurable, so is the set C E, 
and almost all points of C E, since they are points of density for C E t 
are points of dispersion for E. 

In connection with the definition of points of density, Denjoy 
introduced the important notion of approximate continuity of 
a function. We call a function of a point f(x) (in any space Jt m ), 
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approximately continuous at a point x 0 , if f(x 0 ) 00 and f(x)->-f(x 0 ) 

as x tends to x 0 on a measurable set E for which x 0 is a point of 
density. 

(10.6) Theorem. If f is a measurable function almost everywhere 
finite on a set E, then the function f is approximately continuous at 
almost all points of E. 

Proof. On account of Lusin’s theorem (Chap. Ill, §7), given 
any e > 0, we can represent the set E as the sum of a closed set F 
on which the function / is continuous and of a set of measure less 
than 1 . The function / is clearly approximately continuous at any 
point of density for the set F, and so, by Theorem 10.2, at almost 
all points of F. This implies, as f is an arbitrary positive number, 
that the function / is aproximately continuous at almost all points 
of the set E itself. 

Theorem 10.6 is due to A. Dcnjoy [5] (cf. also W. Sierpiriski [6; 0 ]). 
It is easy to see that the converse holds also, i. e. that every function yt'hich is 
approximately continuous at almost all the points of a measurable set E is measurable 
on E ( vide W. Stepanoff [2] and E. Kamke [1]). 

Let us mention also the following theorem, an almost im- 
mediate consequence of Theorem 10.6, which completes, in part, 
Theorem 2.2 on derivation of an indefinite integral: 

(10.7) Theorem. If <D is the indefinite integral of a bounded meas- 
urable function /, then <P~,(x)—f(x) at almost all points x, and in 
fact, at any point x at which the function f is approximately continuous. 

Proof. Let x 0 be a point at which the function f is approx- 
imately continuous and let E be a measurable set for which x 0 is 
a point of density, while f{x)-> f{x 0 ) as x tends to x 0 on the set E. 
We may suppose (by subtracting, if necessary, a constant from 
f(x)) that /(.c 0 )= 0. Therefore, given any positive number f, we 
have for any interval I of sufficiently small diameter con- 
taining x 0 , (i) I CE 1 and (ii) f(x) <£ for every xel-E. 
Denoting by ilf the upper bound of \f(x)\, conditions (i) and (ii) 
imply |0(/j| < \0(I-CE)\ + ,0{I-E)\ < Me- |/|+e-|/| < £-(Jf+l)-|I|, 
whence &\(x 0 )—0=f(x 0 ). 

If if i« the indefinite integral of a function / which is suinmable but un- 
bounded, it may happen that the relation <P*{x)~f(x) is not fulfilled at any point. 
In virtue of Theorem 6.3, this relation clearly holds at almost all points at 
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which the strong derivative &'s(x) exists; but if the function / is unbounded, its 
indefinite integral 0 may have no finite strong derivative at any point (cf. on 
this point Busemann and Feller f 1 , p. 256]; the result of Banach and Bohr 
mentioned above, p. 112, follows as a particular case). 

Nevertheless, the result contained in Theorem 10.7 may be generalized 
considerably. In fact, according to a theorem of B. Jessen, J. Marcinkie- 
wicz and A. Zygmund [1] (see below § 13) the indefmite integral of a function f 
in a space R m is almost everywhere derivable in the strong sense whenever the function 
|/l*(l°g + l/|) m ~ 1 w summable (in a less general form, for functions / of which the 
power p > 1 is summable, this theorem was established a little earlier by 
A. Zygmund [1]). On the other hand however, given an arbitrary function o(t ) 
positive for t> 0 and such that lim inf o(t)~ 0, there always exists a function f(x) 

in R m such that the function ^(|/|H/|*(log + |/|) ,n 1 is summable and such that the 
indefinite integral of f is not derivable in the strong sense (has the strong upper 
derivate +oo) at any point of R m . 

* § 11. Ward’s theorems on derivation of additive func- 
tions of an interval. In the preceding §§ of this Chapter, we 
have treated the Lebesgue theory of derivation of additive func- 
tions of an interval of bounded variation. As regards functions 
of a real variable, this theory has been extended to arbitrary func- 
tions by Montel, Lusin and especially by Denjoy. Recently Denjoy’s 
theorems, which already belong to the classical results of the theory, 
have been generalized still further. On the one hand they have been 
given a geometrical form by which they become theorems on certain 
metrical properties of sets, and in this form an account will be given 
of them in Chapter IX. On the other hand, recent researches of 
Besicovitch and Ward have made it possible to extend an essential 
part of the Denjoy results, particularly the relations between the 
extreme bilateral derivates, to additive functions of an interval 
in a space R m of any number of dimensions. These researches will 
form the subject of the present §. 

It was A. S. Besicovitch [5] who started these researches, by establishing 
between the extreme strong and ordinary derivates of absolutely continuous 
functions of an interval, relations analogous to those proved by Denjoy for 
derivates of functions of a single variable. A.J.Ward[2;5] has extended this 
result to quite arbitrary additive functions of an interval. Of the two theorems 
of Ward (vide, below, Theorems 11.15 and 11.21) one concerns only ordinary 
derivates, while the other applies also to strong derivates. It is the latter that 
generalizes the result of Besicovitch; this second theorem is one which can be 
proved fairly simply for functions of an interval in the plane; it is rather curious 
that it requires much more delicate methods in an arbitrary space R m . 



134 


CHAPTER IV. Derivation of additive functions. 


We shall make use in this § of some auxiliary notations. If F 
is an additive function of an interval and a 1 is a positive number, 
F(„)(x) and F w (x) will denote at any point x the upper and lower 
limit of the ratio F(I)/\I\ where I is any interval containing x, 
which is subject to the condition r (I)^a, and which has diameter 
tending to zero. We see at once that at any point x , F (n) (x) and F (a) (x) 
tend to F(x) and F(x) respectively as a -> 0. 

We shall suppose fixed a Euclidean space R m , and in it we 
define a regular sequence of nets of cubes {Qa)a-i, 2 ,..., denoting by 
{Q*! the family of all the cubes of the form 

[?, 2 *, (p,+l) 2 p 2 2 - *, (p 2 +l) 2“*; ... ; p m 2~ k , (p m +l) 2~*] 

where p v p 2 , ..., p m are arbitrary integers. 

(11.1) Jjfmma . Given an additive function of an interval G, and 

positive numbers a and a , suppose that the inequalities 0 <(?(«)(#) <a 

hold at every point x of a set E having positive outer measure ; then 
there exists for each e > 0, a cube Q which belongs to one of the nets Q * 
and for which we have 

(11.2) 6(Q) < f, \E-Q\>(1—b)-\Q\ and G(Q)<8 m -a~ m .a-\Q\. 

Proof. By replacing, if necessary, the set F by a suitable 
subset of E having positive outer measure, we may suppose that 
there exists a positive number a such that for every interval I, 

(11.3) (?(/)> 0 whenever I-E^O, r (I)^a and S(I) <u. 

We may further clearly assume tnat a is less than both e and a'"/8 m . 

This being so, let x 0 eE be a point of outer density for the 
set E (cf. § 10). Since G (a \(x 0 ) < a, we can determine an interval 
«/=[«), br, a m , b m ] containing x Q and such that 

(11.4) 6 (J)<o, r(J)^a, \E-J\>(1 — a 2 )-|J| and G(J)<a-\J\. 

It follows in particular that 

(11.5) |E-Z|>(1 — a)* JZ| for any interval IC_J such that 

Let l be the smallest of the edges of J. Since r (J)^a, no edge 
of J can exceed i/a, and therefore we have Jj ^ l m ja m . Finally 
let k be the positive integer given by 
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(11.6) 1/2* < f/4 < 1/2* -1 , 

and let Q=[a\, b\\ a m , b m ] be a cube which belongs to the net Q* 
and which contains the centre of the interval J. By (11.6) we find 
that Q(ZJ and that a \ — a t ^1/4, bt — b't^lj 4 and b\ — a)=l/2*^ Z/8. 
It follows easily that the figure J QQ can be subdivided into a finite 
number of non-overlapping intervals with no edge smaller than i/8. 

Now any such interval can clearly be further subdivided into 
a finite number of non-overlapping subintervals whose edges all lie 
between i/8 and i/4. We thus obtain a subdivision of the figure 
JQQ into a finite number of non-overlapping intervals, whose para- 
meters of regularity are greater than, or equal to, 2~ m ^a, and whose 
volumes are greater than, or equal to, a, m -\J\> a -\J\. It 

therefore follows from (11.5) and (11.3) that 

(11.7) 0(JQQ)> 0. 

Similarly, it follows from (11.6) that \Q\=2~ km 8~ m l m ^ 

>8~ m a m -| J\>o-\ J\, whence by (11.8) we derive at once the second of the 
relations (11.2); at the same time, by the relations (11.7) and (11.4), 
(?(#)<<?(J)<a-|J|<;8. m a~ m a-|@|, and this gives the third of the 
relations (11.2) and completes the proof. 

(11.8) Lemma.. Let G be an additive function of an interval in R m , 
E a set in R m , Q a cube belonging to one of the nets Q *, a/nd a > 0, 
e > 0 and b arbitrary fixed numbers. Suppose that 

(i) \E-Q\Xl-em, 

(ii) 0(1) > 0 for every interval 1 such that I(ZQi I E ^0 and 
r(I)^2- m ,_ 

(iii) 0(a)(x) >• b at any point xeE; 

then G(Q)>12~ m -a m b-(l — 2 m t)-\Q\. 

Proof. We may clearly assume that the set E is contained 
in the interior of Q and that every point of the set is a point 
of density. 

This being so, we shall begin by establishing the following 
result: 

111.9) Given any »?>0, we can associate with any point xe E 
‘ cube P, belonging to one of the nets Q*, and a cube JD-P, such 
xt (a) G(P)>a m -12~ m -b-\P\ and (b) xeJ , 6(J)<v and |«7|=3 m -|P|. 
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For this purpose, let us associate with the point x an interval 8 
such that xeS, <5(<S)^l/4, r (S)^a and G(8) > 6-|£|. Let h denote 
the largest edge of S, and let be the positive integer satisfying 
the inequality 1/2*' > 2ft ^ l/2* ,+1 . Let S x be a cube of the net Q*, 
having points in common with 8, and let J denote the cube formed 
by the 3 m cubes of the same net (including the cube 8 1 itself) which 
have points in common with S v 

The cube J clearly contains the interval 8, and since no edge 
of S can be less than ah, we find that 

(11.10) |J|=3 m -2“ m *'=6 m -2~ m( * ,+l) <12 m -A m <12" , -a^ m -|fi'|. 

On the other hand, since 2 * 1 — ft>2 _( * ,+1) and ah^a-2 < *' +2) , the 
figure JQS can be subdivided into a finite number of non-over- 
lapping intervals with edges greater than, or equal to, a-2 2) , and 

therefore, as in the proof of Lemma 11.1, into a finite number of 
non-overlapping intervals whose edges have lengths between a 2 ( * ,+2) 
and a- 2 Therefore, denoting by I any interval of this sub- 

division, we find r (I) ^ 2~ m and \I\^ a m -2~ m{kt+l) — 12 ™ a m • |J|. 
Consequently, by supposing the interval . 8 , and a fortiori the 
cube J , sufficiently small, we may assume that d(J) < r\ and 
that each of the intervals of the subdivision in question contains 
points of E. It follows, by condition (ii) of our lemma, that 
0(JQS)> 0, and so, by (11.10), that G(J)>G(S)>b-\S\^12~ m -a m -b-\J\. 
Thus among the 3 m cubes of the net Q*, which make up the cube J , 
there is one at least, P say, such that G(P) > 12 m -a m -b-\P\ y 
and the cubes P and thus defined, clearly satisfy the con- 

ditions (a) and (b) of (11.9). 

It now follows, on account of (11.9) and condition (i) of the 
lemma, that (with the help of Vitali’s theorem in the form (3.8)) 
we can determine in Q a finite system of non-overlapping intervals 
2*11 2*21 *•* 1 belonging to the nets Q*, such that: 

(11.11) G(Pi)>12~ m -a m b-'P,\ and P,P + 0 for i=l,2,... n, 

(11.12) J|P,-|>(1— e).|<?|. 

<= i 

Among the cubes of the nets Q*, we shall consider specially 
two classes of cubes. A cube of a net Q* contained in Q will be said 
to be of the first class if it is one of the cubes Pi, P 2 , ..., P„; 
and of the second class if it contains points of E and if further 
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among the 2 m cubes (Q*+i) composing it, there exists at least 
one which does not overlap with any cube Pi. Since the number 
of cubes Pt is finite, there exists a net Q*- such that no cube of this 
net contains cubes of the first class. Let 31 be the set of all the cubes 
of the first or second class contained in Q and belonging to the nets 
Q* for k^.K. 

The set of these cubes covers the whole cube Q. For if not, 
there would certainly exist in the net Q k a, cube I 0 (ZQ not con- 
tained in any cube (31). Now, since I 0 contains no cube of 
the first class, I 0 would not contain any point of the set E; and 
since, by hypothesis, 7 0 is not contained in any cube of the first 
or second class, we could, starting with I 0 , form in Q a finite as- 
cending sequence of cubes without points in common with E and 
which belong respectively to the nets Q*-, Ok-\, ■■■ , where 
is the net containing the cube Q. But the last term of this sequence 
of cubes is evidently the cube Q itself, and we arrive at a contra- 
diction since ECQ- 

Let us now remark that since all the cubes (31) belong to the 
nets of the regular sequence {Q*}, it follows that, of any two over- 
lapping cubes (31), one is always contained in the other. Hence, 
we can replace the system of cubes 31 by another system 3l 1 C'2l 
which also covers Q, and which, this time, consists of non-over- 
lapping cubes. Let A be the sum of the cubes (31 t ) of the first class. 
On account of (11.11) we have 

(11.13) 6U)>\2~ m -a m .b-\A\. 

Moreover, since the figure QQA is formed of a finite number 
of cubes of the second class which do not overlap, it follows from 
condition (ii) of the lemma that 

(11.14) O(QQA)^ 0. 

Finally, in each cube I of the second class, there is always 

n 

a cube which is contained in QQ^Pi and whose volume is 2 _m -|I|. 

i=l 

It therefore follows from (11.12) that \QQA\<2 m -t-\Q\, and in virtue 
of (11.14) and (11.13) we find 

0(Q)^0(A)^12~ m a m -b\A\>12^ m -a m -b-(l— 2 m 
which completes the proof. 

(11.15) Theorem. Any additive junction of an interval F_is derivable 
at almost all the points x at which either F(x ) ~> — oo, or F(x) <C + °°. 
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Proof. Consider the set of the points # at which F(x)> — oo 
and suppose, if possible, that the set A of the points x at which 
F(x)'>F(x)'> — oo is of positive measure. We could then determine 
a number a>0, and a set B(2 A of positive outer measure, such 
that F( a )(x) 4= 00 and F(„)(x) — F(«)(x) > a at every point x of B. We 
may clearly assume that a 2~ m . 

Let e be any positive number. Let us denote for any integer p- 
by B p the set of the points x of B at which pe <.F(«.)(x)<^(p-\- 1) e, and 
let p 0 be an integer such that \B P J > 0. We can determine a number 
a>0 and a set Z?C B Pl> , whose measure is not zero, so as to have 
P(/) > p 0 e-|/| whenever the interval / is subject to the conditions 
d(/)<o, r (I)^a and EI^Q. 

Now write 0(1)— F(I) — p 0 e-|Z| (where Z denotes any interval). 
Thus defined, the function 0 clearly fulfils the conditions: 

1° 0 <(?(„)(£) <2e and G^)(x)>a at any point x e E, 

2° (?(!)> 0 for any interval I such that <5(i) < a, r(I)^a 
and E-1 4 = 0. 

By Lemma 11.1 we can therefore determine a cube Q, belonging 
to one of the nets Q*, so as to have d(Q)<o, \E-Q\X1 — fH#| 
and (?(<2)s£^8 m a~' n -2£-|Q|. From the first two of these relations and 
from conditions 1° and 2°, it follows, on account of Lemma 11.8, that 
0(Q) > 12 _m a m+ 1 .(l— 2'" £);#!. Thus 12 m a m+ Ml— 2 m £)<8 m aT m -2e 
for every £ > 0 , and this is clearly impossible. We arrive at a 
contradiction and this shows that |^1|=0, i. e. that F(x)—F(x) for 
almost all x for which F(x) > — 00 . 

It remains to be shown that the set of the points x at which 
the derivative F'(x) is infinite, is of measure zero. Suppose then, 
if possible, that F'(x)=+ 00 at each point a? of a set M of positive 
measure. We may clearly assume that there exists a number 17 >0 
such that F(/)> 0 whenever I is an interval containing points 
of if and subject to the conditions 6(1) <r] and r(Z)^ 2 ~ m . There- 
fore, denoting by R any cube which belongs to one of the nets 
Q * and which satisfies the relations \M-R\ > ( 1 — 2 ~ (m+ 1 ) )-|i?| and 
we find easily from Lemma 11.8 that F(R) > 2 ~ 1 - 12 ~'"-ft-|iJ| 
for every finite number b. We thus again arrive at a contra- 
diction and this completes the proof. 
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It should be remarked that for the validity of Lemma 11.1 it is enough 
to suppose merely that a<\ (instead of a < 2 ~ m ). Similarly in condition (ii) of 
Lemma 11.8, the inequality r(J) ^ 2~ m may be replaced by r (I)^>a. The proofs 
of the lemmas remain essentially the same; we need only observe that if I is an 
interval whose parameter of regularity is greater than, or equal to 2~ n \ and if 
a < 1 is a positive number, the interval I can always be subdivided into a finite 
number of non-overlapping subintervals Zy, where ;= 1,2,..., such that t(Ij) 
and \Ij\^>k a *\I\, where k a is a constant depending only on a. 

We can now easily see that Theorem 11.15 may be stated in a slightly 
more general form as follows: any additive function of an interval F is derivable 
at almost all the points x at which either F^(x) ? — oo or F^(x)< -f-oo, where a 
i 8 any positive number less than 1. The question whether the condition 1 is 
necessary here, does not seem to have been solved yet completely. It may how- 
ever easily be proved (by the method of nets used in the proof of Lemma 11.8) 
that for any additive function F f the set of the points x for which either F^(x) = — oo 
or F^)(x) = -f-oo is of measure zero. For a discussion of these questions, vide the 
memoir of A. J. Ward [5]. 

We shall now proceed to prove the second theorem of Ward, 
in which the ordinary extreme derivates and F(x) of Theorem 1 1.15 
are replaced by the strong derivates F*(x) and F^(x). It should be 
remarked however, that we cannot at the same time replace, in 
the assertion of Theorem 11.15, derivability in the ordinary sense 
by derivability in the strong sense: in fact, in general, a non-negative 
function, even when it is absolutely continuous, may yet have a 
strong upper derivate which is everywhere infinite (see p. 133 above). 

We shall begin by proving the following lemma which is sim- 
ilar to Lemma 11. 1. 

(11.16) j Lemma. If 0 is an additive function of an interval in lt m 
and if for some fixed number a we have 0 < O s (x) < a at every point 
of a set E of positive outer measure , then given any e > 0 there exists 
an interval Q such that 

(11.17) m<e, *(Q) > 2" m , EQ$ 0 and G(Q)<3 m -a-\Q\. 

Proof. Let us write for brevity y=l/3"'. We may suppose 
(by replacing, if necessary, the set £ by a subset of positive outer 
measure) that 0(1) > 0 for every interva) I containing points of E, 
which has diameter less than a positive number a<e. Let x 0 eE be 
a point of outer density for E and let J-=[a l , b x \ a 2 , ... ; a,„, fe m ] 

be an interval containing x 0 such that 
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(11.18) <5(J)<a, |.E?-t7|>(l — y)-|J] and G(J)<a-\J\. 

Let us denote by l the smallest edge of J and by » x the positive 
integer satisfying the inequality 

(11.19) n x Zs^6 x — « x < (n x +l) Z. 

Writing (6,^ — a x )/n x , let us subdivide the interval J into » x 

equal non-overlapping subintervals 

<//=[« x + (t 1) d x , }~ * d x j ••• 5 a m ,b m \ 

where i=l, 2, ... , n x . We shall call an interval J, of the first kind 

if \E-J\ > (1 — 3y)-|«7/|, and of the second kind in the opposite 

case. Denoting by a summation over the indices i corresponding 
( 

to intervals of the second kind, we see easily, on account of the 

v "i 

second of the relations (11.18), that 2! 3y-|J(!<y-|J|=^’y Hence, 

i /= i 

if p and q are the number of intervals Jt of the first and second 
kind respectively, we find 3q<n l =p+q. Now let us subdivide the 
interval J into a finite number of non-overlapping subintervals, in 
such a manner that each of these is the sum of a certain number 
of intervals J,- among which exactly one is of the first kind. Since 
2q < p, the intervals of this subdivision include some which coincide 
with certain intervals J,- of the first kind, and their number is at 
least equal to p — q > n 1 /3. Thus if we denote their sum by A, 
we find 

(11.20) |A|>|J|/3. 

On the other hand, the figure JO A consists of a finite number 
of non-overlapping intervals' each of which contains an interval J/ 
of the first kind, and therefore points of E. Consequently, 6{J © A) >0, 
and, on account of (11.18) and (11.20) 

G(A) <G(J) <a-\J\<3a-\A\. 

It follows that among the intervals Ji of the first kind of which 
the figure A is formed, there exists one at least, J/, say, such that 
G(JiJ < 3 <*• . 

Let us write, for brevity, ®?=« x +(* 0 — l)d„ 6®=®,+*,^, and 
J (,) = <7, „=[<**, 6®; a 2 ,fe 2 ; ... ; a m , ft J. By the above, J (1) C«7, G(J W ) < 3a- |J (,) I 

and, since J (>) coincides with an interval Ji of the first kind, 
\E-J W \>(1— 3y)-|J (1> ,; finally by (11.19), Z^fc®— a®=d,<21. 
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If we now operate on J (l) just as we formerly did on J (except 
that we replace y by 3 y, a by 3« and the linear interval [o x , 6 X ] by 
[o 2 ,ft 2 ]), we obtain an interval J <2) =[a°, a®, 5°; a 3 , 6 3 ; ... ; a m , bJC^ 

such that <7(J (2) )<3 2 -a-|J (2) |, and a ; °<21 

for ?=1,2. 

Proceeding in this way m times, we obtain after m oper- 
ations an interval J (m) =[a®, 5®; a®, 6®; ... ; C J such that 

\E-J im) \>(i—3 m y)-\J <m) \ and l<&»—a»<21 
for j=l, 2, ... , w. It follows that r( J <m) ) ^ 2 _m , and if we write 
Q—J (m) and substitute y=3~ m , we find at once that the interval Q 
fulfils the conditions (11.17). 

(11.21) Theorem. If F is an additive function of an interval, 
we have F'(x)=F s (x)^=oo [F'(x)=F s (x) 4= oo] at almost all the points 
at which F s (x) > — oo [F s (a;) < + °°3- 

Thus, in particular, the function F is derivable in the strong 
sense at almost all the points at which both the extreme strong derivates 
F-,(x) and F s (x) are finite. 

Proof. Since F(x)^F s (x) holds for all x, the function F is, 
by Theorem 11.15, derivable (in the ordinary sense) at almost all 
the points x for which Fs(x)> — oo, and we have only to show 
further that at almost all these points F'(x)~F s (x). Suppose there- 
fore that the set of the points x for which F'(x)~> F s (i r)> — oo is of 
positive measure. We could then determine a number a > 0 and a set 
B of positive measure such that F'(x) — F f (x)~>a at every point xeB. 

For brevity, write £=a-3 _(m+1) , and let B p denote the set of 
the points xeB for which pe <F s (x) ^.(p+l)-e. Let p 0 be an 
integer such that \B Pa > 0, and write G(I)=F(I ) — p 0 e ■ |/| 
(where I is any interval). Since G'(x) > G s (x)+a > a at every point 
x e B Pa , we can determine a positive number a and a set EC B Po of 
positive measure, such that G(Q)>a-\Q l whenever Q is an interval 
satisfying the conditions 

(11.22) d(Q)<o, r(Q) > 2~ m and EQ = f=0. 

But since 0 < G s {x) < 2t at every point xeECBp*, there 
exists by Lemma 11.16, an interval Q subject to the conditions 
(11.22) and such that G(Q) <3 m -2e-\Q\ <a-|Q|. We thus arrive at 
a contradiction and this proves the theorem. 
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*j( 12. A theorem of Hardy-Little wood. The theorem of 
Jessen, Marcinkiewicz and Zygmund concerning strong derivation 
of indefinite integrals, which was mentioned in § 10, p. 133, is 
connected with an important inequality due to G. H. Hardy and 
J. E. Littlewood [2], This inequality, which was established in 
connection with certain problems of the theory of trigonometrical 
series, thus obtains a new and interesting application. 

We reproduce in this § the elegant proof given by F. Riesz [5] (cf. also 
A. Zygmund [I, pp. 241 — 245]) for this inequality. Although simpler than the 
other proofs, it requires nevertheless some rather delicate considerations. Certain 
parts of the argument have been touched up in accordance with suggestions 
communicated to the author by Zygmund. 

The reasonings of this § concern functions of a real variable. 

(12.1) F. Riesz* s lemma . Let F(x) be a continuous function on 
an interval [a, 6] and Jc a finite number . Let E be the set of the points x , 
interior to the interval [a, 6], for each of which the inequality 
F(x) — F(u)> k-(x — u) is fulfilled by at least one point u subject to 
a < u < x. 

Then the set E is either empty , or else expressible as the sum of 
a sequence {(a n , b n )} of open non-overlapping intervals such that 
F(b n )—F(a n )^k-(b n — a n ). 


Proof. By subtracting from F(x) the linear function kx , we 
may suppose that k— 0. Then E is the set of all the points x of the 
open interval (a, ft), for each of which there exists a point u such 
that F(u) <CF(x) and a < u < x. Since the function F is continuous, 
the set E is clearly open, — and, unless empty, it is therefore ex- 
pressible as the sum of a sequence {(a„, b n )} of non-overlapping 
open intervals. We have to prove F(a n )^F(b n ) for each n. 

To see this, let us fix an index n and suppose that F(a n ) > F(b n ). 
Let h be any number such that 

(12.2) F(a n )>h>F(b n ), 

and let x 0 be the lower bound of the points x of the interval [ a nj b n ] 
for which F{x)—h. By (12.2) the point x 0 belongs to the open 
interval (a„, fc„), and so to the set E\ thus there exists a point y 
such that F(y)<F (x 0 )=h < F ( a n ) and a<y<x 0 . This last relation 
implies a < y < a„, since, by (12.2) and by the definition of the point 
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x 0 , the inequality F(y)<h cannot hold for any y of the interval 
[<*/!> *o]' Thus F(y)<.F(a„) and a<y<a„, and consequently a„eE ; 
but this is - clearly contradictory, since a„ is an end-point of one 
of the non-overlapping open intervals which constitute the set E. 

Besides the results treated in this §, many other applications of Lemma 12.1 
are given by F. Riesz [6; 7; 8], particularly in the theory of derivation of func- 
tions of a real variable. Cf. also S. Izumi [1], The lemma might also have 
been used in the considerations of § 9 (instead of appealing to the theorems of 
§ 8 on rectifiable curves). 

To shorten our notations we shall restrict ourselves in the rest 
of this § to functions defined in the open interval (0, 1); and we 
shall agree to write E[/ > a] for E [/(*)> a; 0 < x < 1]. The symbols 

X 

E[/^a], E[h>/>a] and so on, will have similar meanings. 

Two measurable functions g and h in (0, 1 ) will be called (in 
accordance with the terminology of F. Riesz) equi-measurable if 
mg > a]|=|E[ft > a]| for every finite number a. We see at once that 
we then also have 

m9>a]\=m h > a t E\[b>g^a]\=\E(b>h^a]\, etc. 

(12.3) If two non-negative measurable functions g and h in the in- 
terval (0, 1) are equi-measurable, their definite integrals over this in- 
terval are equal. 

To see this, let us associate with the function g a non-decreasing 
sequence {g n } of simple functions by writing g n (x)=(k — 1)/2" when 
(ft — l)/ 2 ' , <;< 7 (^c)<ft/ 2 ' , and ft=l,2,...,2"-n, and g n (x)=n when g(x)^n. 
Similarly with g replaced by h, we define the sequence {h n } con- 
verging to the function h. If we calculate directly the integrals 
of the functions g n and h n over (0, 1) by formula 10.1 of Chap. I, 
p. 20, we see at once from the fact that the given functions g and h 

i i 

are equi-measurable that J g n (x) dx— j h n (x) dx. Making n— >co, 

this gives Jg(x)dx=Jh(x)dx as asserted, 
o o 

If / is a continuous function in (0,1) which is not constant 
on any set of positive measure, and if m and M denote the lower 
and upper bound of / respectively, the function ?(y)=|E[/>y]| is 
evidently continuous and decreases from 1 to 0 in the open interval 
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(m, M). Its inverse function is therefore continuous and decreasing in 
(0, 1) and, as we easily verify, equi-measurable with the given function. 

We shall extend this process with suitable modifications, to 
arbitrary measurable functions finite almost everywhere in (0, 1). 

With any such a function / ( x ), we associate the function f*(x) de- 
fined for each x of (0, 1) as the upper bound of the numbers y for 
which |E[/>y]|>a;. The function f{x) is clearly finite and non- 
increasing in (0, 1). To show that this function is equi-measurable 
with f(x), let y 0 be any finite number, and let x 0 denote the upper 
bound of the set E[/“> y 0 ], or else x 0 =0 if this set is empty. Then 
since |E[/°> y 0 ]| = a? 0 , it has to be proved that |E[/ > y 0 ]| = x 0 . 

We have, in the first place, f a (x 0 +e)^.y 0 for every t>0 
(provided, of course, that £c 0 -|-c < 1), so that |E[/ > y 0 +f]| ^a? 0 + c > 
and therefore |E[/ > y 0 ]| On the other hand, f(x 0 — e) > y 0 

for every £>0 (provided that x 0 — £>0), so that |E[/>y 0 ]|> a: o — c > 
whence |E[/> y 0 ]|^«o» and finally |E [/ > y 0 ]| = a? 0 = E [/" > y 0 ]. 

We shall further define, in connection with any summable 
function f(x), three functions f'(x), f\x) and f(x). At any point x 
of (0, 1) we shall denote by f'(x) the upper bound of the mean 
values of / on the intervals (u, x) contained in (0,#), i. e. the 

upper bound of the numbers lf(t)dt for 0<w<£e. Simi- 

CO — w J 

u 

3 If 

larly, f ‘{x) will denote the upper bound of the means j f(t) dt 

X 

for #0<1. Finally, f(x) will denote the larger of the two num- 
bers f'(x) and f J (x), or what comes to the same, the upper bound 

of the means jf(t)dt where u and v are subject to the 

u 

condition 0<w<#O<l. 

(12.4) Lemma . If f(x) is a non-negative measurable function in 
the open interval (0, 1) and if E is a set contained in this interval , then 

\E\ 

j f(x) dx . 


j f(x) dx < 
e c 



A theorem of Hardy-Littlewood 


145 


[§ 12 ] 


Proof. Let f 1 be the function equal to / on the set E and to 0 
elsewhere. We evidently have f"(x)^.f a (x) at each point x of the in- 
terval (0,1). Furthermore f"(x)= 0 as soon as x>\E\. Therefore on 

1 1 |£| 

account of (12.3) we find f f(x)dx = J f 1 (x)dx=f f“(x)dx*^.j f(x)dx. 

k 6o o 


(12.5) Lent mu. If f(x) is a non-negative summable function in the 
interval (0,1), then for each point x of the interval, we have 

ff(t) dt. 

*6 

Proof. Let .r 0 be any point in (0,1), let y 0 =/ ,| “( a? 0 )> and 
let £ denote an arbitrary positive number. We write A = E[f 1,a >y 0 — £] 
and B=E[/ , ‘> y 0 — fj. Then since the function f' n is non-increasing, 
we have. \A\~^x 0 and therefore, remembering that the functions f' a 
and f' are equi-measurable, \B\=\A\^x Q . 

Now B is the set of the points x for each of which there exists 

X 

a point n subject to the conditions f f(t)dt>(y 0 — e)-(x — v) and 

u 

0 < w < .r. Therefore, applying F. Riesz’s Lemma 12.1 to the in- 
definite integral of /, we find easily that B is an open set and that 

ff(t)dt^(g o — ^ follows by Lemma 12.4 that 

B 

1 1 ,?l 

(12.6) yo—e<^jMdt^j2\lf(t)dt. 


Now since \B\^x 0 and since the function f" is non-increasing, the 
last term of (12.6) cannot exceed — //"(<) dt; and since t is an 


'•o 


arbitrary positive number, we must have f' a (x 0 )=y 0 ^ ! — I f‘(t)dt. 

x o 6 

This completes the proof. 


(12.7) Theorem of Hordy-Littleu'ood. If f(x) is a non-negative 
summable function in (0, 1) and e is a positive number, then 


i | i 

(12.8) ff\x)dx^A- 1 f(x)-log+ f(x)dx+B- 1 f(x)dx+e, 
o* (i 6 

where A and B are constants depending on e, bnt not on /. 
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Proof. We first evaluate the integral of f‘ over (0, 1). Ac- 
cording to (12.3) and Lemma 12.5 we have 

jf'(x)dx=jf' a {x)dx^j ft-^dy\dx. 
o o o o ® J 

In virtue of Fubini’s theorem the last member of this inequality 
is the surface integral of the function f{y)jx over the triangle 
0<«<1, ^ x. Therefore inverting the order of integration in 

this member, we find 

(12.9) / f(x) dx^flf H f(y) dy — |log dy. 

0 o J 6 

n 

Let now r\ < l be a positive number such that ^|log y| / \y dy < f/2. 

6 

Let us denote by E 1 the set of the points y of the interval (0, rj) at 
which f a (y)^.lfyy, and by E 2 , the set of the remaining points of 
this interval. We find 

(12.10) Jf(y)-\logy\ dy^ dy+2 j f(y). : logf‘(y) dy+ 

o *y 

1 1 i 

+ \\ogr]\.ff a (y)dy <2 f f(y)-\og+ f(y) dy -\-\\og rj\- j f a (y)dy+ f/2. 

tj o o 

Further, since the functions / and f are equi-measurable, so 
are the functions /-log + / and /“-log + /“, and it therefore follows 
from (12.9) and (12.10) that 

1 i i 

(12.11) J f'(x)dx^2 f f(x)’log+ f(x)dx+\logt]\’ J f(x)dx-\- f/2. 

oo o 

A similar inequality clearly holds when on the left-hand side 
of (12.31) f' is replaced by f% and on adding the two inequalities 

i i j i 

we find Jf(x)dxK 1 J f'(x) dx+J f\x) dx ^ 4jf(x)- log + f(x)dx+ 

000 o 

1 

-f |21ogi?|> Jf(x)dx+e-, this gives (12.8) with A = i and with B=2|log>?|. 
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*$ 13. Strong derivation of the indefinite integral. 

We proceed to prove the theorem of Jessen, Marcinkiewicz and 
Zygmund. We shall give the proof for the case of the plane; 
its extension to spaces R m of any number of dimensions (cf. §10, 
p. 133) presents no fresh difficulties and is effected by means of 
the well-known inequality of Jensen. 

We shall begin with some auxiliary remarks. Suppose given 
a non-negative function f{x, y) summable over the open square 
J 0 =(0, 1; 0, 1). By Fubini’s theorem, the function f(x,y) is sum- 
mable in x over (0,1) for almost all y of (0, 1). Denote by H the 
set of these values of y. For any yeH and for any x of the interval 
(0, 1), we shall denote (cf. §12, p. 144) by f(x, y) the upper bound 


of the mean 


1 


v — u 


O 

Jf(t, y) dt 

ll 


for 


0 < m < x < v < 1 ; and whenever 


yfCH, we shall write, for definiteness, f(x, y) = 0 identically in x. 
We shall prove that the function f(x, y) thus associated with any 
function f(x, y) which is summable over the open square <7 0 = (0,1 ; 0,1 ), 
is measurable. 


For this purpose, let a and b denote two positive numbers, 

.V-f b 

and write y)= / f(t, y)dt when ycH and — aCx-jb^l, 


and y„.h(*r, y) = 0 elsewhere in J 0 . We shall begin by showing that 
each of the functions g a ,b(x, !l) is measurable. By Lusin's theorem, 
or more directly by the theorem of Yitali-Caratheodory (Chap. Ill, 
§7), the function / is equal almost everywhere to the limit of a non- 
decreasing sequence [f n) ] of non-negative, bounded, upper semi-contin- 

x + b 

uous functions. Now, let us put <J { "\,(x. y)— f f {n) (t, y) <lt when 

x— a 

0 ^ — a < x -f- b ^ 1, and #)=0 elsewhere. As is easy to show 

(e. g. by means of Theorem 12.11, Chap. 1), each of the functions 
y) is then also upper semi-continuous, and since, as we readily 
see, (j a b (£j y)==\hn(/£ ) h (j',y) almost everywhere, the function g f ij,(*r,y) 

n 

is measurable. 


Finally, with the same notation as above, if !«,,} is the 
sequence of rational numbers of the interval (0, 1) we have 

/'V, !/) = upper bound g u >u (x, .</),(«„—«,,) 

P. <1 P q 
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at any point (x, y) of J 0 . Thus the function f(x, y) is also measurable, 
and this proves our assertion. 

(13.1) Theorem. If f(x,y) is a measurable function in the plane 
_/? 2 and if the function f log+ |/| is summable, then the indefinite integral 
of f is almost everywhere derivable in the strong sense . 

Proof. Clearly we need only consider the function / in the 
open square J 0 = (0, 1; 0, 1); and we may also suppose that this 
function is non-negative. 

We write g n (x, y) = f(x, y) wherever f(x,y)^n, and g n (x,y)—n 
wherever f(x,y)^>n} we write further h n (x, y) = f(x, y) — g n (x,y) 
and we denote by o an arbitrary positive number. The functions 
hi(x, y) are measurable and non -negative; so that by Theorem 12.7 
of Hardy-Littlewood, 

I fhi(x,y)dxdy <: 

(13.2) . •>„ 

fh„(x,y)- log+ h n (x,y) dxdy+B J fh n (x, y)dxdy+^a 2 , 

j u ./„ 

where A and B are finite constants depending only on o. And since 
the integrals on the right-hand side of (13.2) tend to 0 as n-> oo, 
there exists a positive integer N such that the left-hand side of (13.2) 
becomes less than a 2 for n = N. Therefore, writing for brevity 
h(x , y) = hx(x, y) and g(x , y)=g N (x , y ), we have 

(13.3) j j h d (x , y)dxdy <a 2 , 

' •>« 

so that in particular the function h"(x, y) 1 besides being measurable 
and non-negative, is summable on J 0 . 

Now denote by E the set of the points (^ 0 , y 0 ) of J 0 such that 

1° j h*(x 0 , t) dt<. + ^, and 2° the indefinite integral fh*(x Q ,t)dt 
6 o 

has at the point y = y 0 the derivative h^(x 0J y 0 ) with respect to y. 
Since, by Theorem 6.3, condition 2° is fulfilled for almost all y 0 
of (0, 1 ) provided that condition 1° is satisfied, it follows at once 
from Fubini’s theorem in the form (8.6), Chap. Ill (cf. also Theo- 
rem 6.7, Chap. Ill) that | J57|=|e/ 0 |=1. 

Let us write F, H and 0 for the indefinite integrals of the 
functions /, h and g , respectively, in J 0 . Let (^ 0 , y 0 ) be a point of the 
set E and I — [x 0 — n ly x 0 -\-u 2 ;y 0 — v v y 0 + v 2 \ any interval con- 
taining (# 0 , y 0 ) and contained in J 0 . We have 
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H(I) 

|/| 


f 2 +»x I [w 2 +«i f h{x 0 +u, y 0 +i>)dttj 

— *»i - u i 

1*2 

J h'( x o, y»+v)dv , 




whence making <5(Z)-*0 we obtain H .(x QJ y 0 )^ h?{x 0 , y 0 ). Thus, 
since (# 0 , y 0 ) is an arbitrary point of the set E C J 0 of outer measure 1, 
and since the extreme derivate H k (x 01 y 0 ) is measurable (cf. Theo- 
rem 4.2), it follows from (13.3) that 0^ H^(x, y)^H.(x, y)^.a 
at every point ( x , y) of J 0 , except at most a set of measure less than a. 
On the other hand, since the function g — g N is bounded, its in- 
definite integral G is, by Theorem 10.7, derivable in the strong 
sense almost everywhere. Therefore F^{x^y) — F^(x, y)^a at all but 
a subset of measure o of the points of J 0 ; and so finally, since o 
is an arbitrary positive number, F k (x, y)=Fs(x y y) almost every- 
where in J 0 , which completes the proof. 


By Ward’s Theorem 11.21, to prove that the non-negative function F is 
almost everywhere derivable in the strong sense, it is enough to show that 
F s (i)< + oo almost everywhere. Hence by using Theorem 11.21, the proof of 
Theorem 13.1 might be slightly shortened. 


* § 14. Symmetrical derivates. If 0 is an additive function 
of a set in a space Jt m , we shall denote by the upper , and 

by Dsw„ 0(x) the lower, symmetrical derivate of 0 at a point x , 
these being defined respectively as the upper, and as the lower, 
limit of the ratio 0(8) /IS | where S represents a closed sphere of 
centre x and of radius tending^to zero. It is obvious that, for any 
point x whatsoever D<P(®) ^ D sym 0(«) ^ Dm m ^ W ^ D0(®). 

Following A. J. Ward [5], we shall establish a decomposition 
theorem in terms of symmetrical derivates, which is similar to 
Theorem 9.6. We shall begin by the following “covering theorem”: 

(14.1) Theorem. If 0 is an additive function of a set in R„, and E 
a bounded set measurable (©), contained in an open set 0, then for 
any e > 0 there exists in 0 an enumerable sequence of closed sphe- 
res (5*) such that (i) the centre of each 8k belongs to E and the ra- 
dius is less than t , (ii) Sr 8j = 0 whenever i =(= j, and (iii ) the spheres 
8k cover together the whole of the set E, with the possible exception 
of a subset on which the function 0 vanishes identically. 
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Proof. We can clearly assume (by replacing, if necessary, 
the function 0 by its absolute variation) that the function 0 is 
monotone non-negative. 

a) We shall first prove that, with the hypotheses of the 
theorem, there always exists in G a finite system of equal sphe- 
res (iSy, which sasisfy the conditions (i) and (ii) and cover the 
set E except perhaps for a set TQ E such that 

(14.2) 0(T)^(l-lll' n+ 'm m )-0(E). 

To see this, let A be a subset of E, measurable (*©), such 
that 0(A) ^ .)■ 0(E) and e(A, CG) > 0. Let n 0 be a positive in- 
teger such that m/n 0 <i(>(A,CG) and m/w 0 < f. 

Denote by the net in the space R„„ which consists of the cubes 
of the form [pjn 0 , (p,+l)/n # ; p.Jn 0 , (p 2 +l)/n 0 ;...,pjn 0 , (p m -fl) 
where p v p 2 , p m are arbitrary integers. We can clearly sub- 
divide the net into (4 m) m families of cubes, ^ p ^ 2 ,...,^ (4m)/l , say, 
such that the distance between any two cubes belonging to the same 
family is not less than (4m— l)/n 0 . Denote, for each fc— 1,2, ...,(4m) /M , 
by 4 A the part of the set A covered by the cubes of the family k . 
Then there exists a positive integer A; 0 ^(4m)" ? such that 

(14.3) 0(Ak) > 0(A)/(4m) m > 0(E)/ 4" ,+1 to”. 

Now let P,, P- 2 , ..., P r be those cubes of which contain 
points of Ak„- With each Pi we can associate a closed sphere 
of radius m/n 0 , whose centre belongs to .4*. -P,-. The system of spheres 
Si, Si , ..., S r thus defined is contained in G and clearly satisfies 
the conditions (i) and (ii) of the theorem. Again, since P/C^/ for 
every t = l, 2, ..., r, the spheres Si cover the whole of the set E 
with the possible exception of the points of the set T=E~A ko , 
which, in virtue of (14.3), fulfils the condition (14.2) 

b) We now pass on to the proof of the assertion of the 

theorem. By what has already been proved, we can define by 
induction a sequence , 2 of finite systems of closed spheres with 

centres in E and radii less than e, subject to the following two 
conditions: 1° If B 0 denotes the empty set and B„, for 1, the 
sum of the spheres belonging to Si + S 2 + ... + S„, then the system 
S n +i, where n^> 0, consists of a finite number of closed spheres, 
contained in the open set G—B„, no two of which have common 
points; 2° 0(E—B n+t )^ (1-A m ) • 0(E~B n ) where A m =l/4" ,+1 m" 
and n= 0, 1, ... Now, arranging the spheres belonging to the family 
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6i+®2+ — + ®n+.** in a sequence {#,}, we see at once that the latter 
fulfils conditions (i) and (ii) of the theorem. On the other hand, by 2° 
we have 0(E — B n ) ^ (1 — h m ) n -0{E) for each n; whence, denoting 
by B the sum of all the spheres 8,, it follows that 0(E—B)= 0, 
which establishes condition (iii) and completes the proof. 

Theorem 14.1 may be established in a slightly more general form: 

Given a bounded set E measurable (33). a sequence of positive numbers {r n } 
converging to 0 and a family of closed sets '31, suppose that with each point x of E 
there are associated two finite numbers a—a(x), N—N{x), and a sequence {A n (x)) 
of sets (31) such that 8{x;rn)ClAn(x)£28(x;arn) for n > N(x). 

Then, for any sequence {dt,} of additive functions of a set, we can extract 
from % a sequence of sets { Ai—An i (x i )} such that (i) Xi t E for i=l,2, .... (ii) Ac A]— 0 
whenever i ± j, and (iii) the sets At cover the whole of the set E, with the exception 
at most of a set of measure zero on which all the functions 0 n vanish identically. 

(14.4) Lemma. If <P is an additive function of a set in R m , and if 
D sym ^(£c)>0 at each point x of a bounded set X measurable (53), 
then <P(X)^ 0. 

Proof. Let us denote, for every positive integer n, by X„ the set 
of the points xeX such that 0(S)^ 0 whenever $ is a closed sphere 
of centre x and radius less than 1 /«. Each set X n is evidently meas- 
urable (53), iD fact closed in X. Hence, for any e>0, we can associate 
with each X„ an open set G n Z) X n such that W(<P; G„ — X„)-^.e 
(cf. Theorems 6.9 and 6.10, Chap. III). Next, keeping n fixed for 
the moment, we can (on account of Theorem 14.1) define in G„ 
a sequence {#*} of closed spheres with centres in X„ and radii less 
than 1 /n, such that (i) S r Sj—0 whenever i =}= j, and (ii) the sphe- 
res 8m cover the whole of the set X„ with the exception at most 
of a set T on which the function ® vanishes identically. SinCe 
0(8*) 0 for every k, we find by (i) and (ii) that 0(X n ) ^ 

[W(0; T) + W(0',G„— X„)]^— e. Hence, as X = lim X„ and t 

n 

is an arbitrary positive number, it follows that 0(X)^O, which 
completes the proof. 

(14.5). Theorem. If 0 is an additive function of a set in R m , 
and if Aa, denotes the set of x at which one at least of the derivates 
Dsym 0{x) and D sym 0(x) is infinite, then for any bounded set X 
measurable (53), we have 

(14.6) 0(X)=0(X-A x )+fl)0(x)dx. 

X 

Consequently , if D sym 0(#)> — oo at every point x and D0(#)^O 
at almost every point x of a bounded set X measurable (©), then 
0(X)>O. 
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Proof. We firstly remark that if — oo<p M1 „0(,r) at each point x 
of a bounded set Q measurable (5?) and of measure zero, then 
0 (<*>)><). In fact, denoting for each positive integer n by Q„ the 
set of the points x of Q at which — /i<p. u ,,0(a;), and writing 
0„(A’)=0(A’) + n-LY|, we obtain p^,„0„(.r)>O at every point xeQ„. 
Hence, by Lemma 14.1, we must have 0(<2„)=0„(<?„)^O, and making 
n->oo we find 0(<?)^ 0. By symmetry we also have 0(P)^O when- 
ever Q is a bounded set measurable (53) of measure zero, such 
that D0(.r)<-f oo at each point x of Q. 

We pass on to the proof of formula (14.6). By Theorem 7.3, 
there exists a set A, measurable (53) and of measure zero, such that 
the relation 

(14.7) 0(A)=0(X-A) + [D0(x)dx 

x 

holds whenever A is a bounded set measurable (53). Since the set 
Aoo is of measure zero, we see at once from the equation (14.7) that 
the function 0 must vanish identically for all the subsets of Aoo—A, 
which are bounded and measurable (53). On the other hand, by what 
has just been proved, the function vanishes also for all subsets of 
A — Ao o. Hence the set Aoo may be taken in place of the set A in 
(14.7) and this gives (14.6). Finally, if p s ,,„0(;e)>— oo at every 
point x of a bounded set X measurable (53), then 0(A , A oo )^O 
and the second part of the theorem follows at once from the first. 

Let us mention the following consequence of Theorem 14.5: If at each 
point x both the symmetrical derivates of a given additive function of a set are finite , 
the latter is absolutely continuous. For ordinary derivates the corresponding pro- 
position has long been known (cf. IL Lebesgue (5, p. 423J) and is moreover 
included in Theorem 15.7 of this chapter, as well as in Theorem 2.1 of Chap. VI. 

*§15. Derivation in abstract spaces. With certain hypo- 
theses, a process of derivation may be defined for additive func- 
tions of a set in any separable metrical space, and for such a process, 
theorems similar to those of §§ 7 and 9 may be established. 

(15.1) Letnma. If 0 is an additive function of a set (53) on a metrical 
space M , then given any set X measurable (53) and any s>0, there 
exists an open set G such that 

(15.2) W(0; G — X)<e 


and 


W (0; X — G)<e. 
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Proof. Let 93 0 denote the class of the sets A' measurable (93) 
for each of which there exists, however we choose e>0, an open 
set G satisfying the relations (15.2). Since any closed set F is the 
limit of a descending sequence of open sets, we observe easily 
(cf. Theorems 5.1 and 6.4, Chap. I) that there exists for each e>0 
an open set such that W(0; G — F)<e. The class 93 0 thus 

includes all closed sets; to prove that 93 =93 0 , it suffices, therefore, 
to show r that the class 93 0 is additive. 

To do this, w r e choose e>0 and denote by X the sum of a se- 
quence (A’„}„-i. 2 ,... of sets (93 0 ). To each set X„ there corresponds an 
open set G„ such that W(0; G„ — A„)<e/2" and W(0; X n — G„)<e/2". 
Writing G—^G„, we clearly find that the inequalities (15.2) are 


satisfied. Therefore A'e93 0 . 

Again, suppose that e>0 and that A=CY, where l r f93 0 . 
There will then exist an open set H such that W(0; Y — H)<e/2 
and W ( ( P ; H — Y)<e. Consequently writing P=CH, we find that 
(15.3) W(0; P — X)<e/2 and W(0; X — P)<e. 

But since the set P is closed, there exists an open set G such that 
G^P and W (0; G — P)<e/2; and this implies, on account of (15.3), 
the inequalities (15.2) and so completes the proof. 


We shall call net in a metrical space M any finite or enumer- 
able family of sets measurable (9?) no two of which have common 
points and which together cover the space M. The sets constitut- 
ing a net will be called its meshes. A sequence (951,,) of nets will 
be termed regular , if each mesh of 901„ + i (where w>0) is contained 
in a mesh of 90f„ and if further d(901„) — >-0 as «->oo (where 4(951„) 
denotes the characteristic number of 95?„; cf. Chap. II, p. 40). It is 
easy to see that in order that there exist a regular sequence of nets 
in a metrical space, it is necessary and sufficient that this space 
be separable. 

In the rest of this § we shall keep fixed a separable metrical 
space M and we shall suppose given in M a regular sequence 901 = (901„) 
of nets and a measure ft which is defined for the sets measurable 
(93) and which is subject to the condition n{M)<-\-oo. Let 0 be 
an additive function of a set (93) on M. For x e M, where M is any 
mesh of a net 901„, let us write 


d„(x)= 


-f- oo 
— oo 


when 

when 0 and 0(3/) ^ 0, 

when 0 and 0(3/) <0. 
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The functions d n (x) are thus defined on the whole space M 
and are measurable (33). Let us write (p, 93l)D0(®) = lim sup d n (x). 

___ n 

The number (p, 93t)D0(a?) thus defined will be called upper derivate 
of the function 0 at the point x with respect to the measure p and the 
regular sequence of nets 991. Considered as a function of x, this upper 
derivate is clearly measurable (33). Similarly we define the lower 
derivate (p, ( 3Jl)T)0(x). If at a point x the two numbers (p, 93t)D0(a;) 
and ( p , 931)1)0 (a; ) are equal, their common value will be written 
{p, 93l)D0(a?) and called derivative of the function 0 at x with respect 
to the measure p and the regular sequence of nets 991. For the rest 
of this §, a measure p and a regular sequence of nets 931 will be 
kept fixed in the space M. 

(15.4) Lemma. Let 0 be an additive function of a set (33) on the 
space M. Then 

(i) if the inequality (p,iSl)T>0{x)^k, where k is a finite number, 
holds at every point xofa set A measurable (33), we have 0(A)^k-p(A); 

(ii) if at each point x of a set B measurable (33) and of measure (p) 
zero, the derivative (p,3Jl)D0(x) either does not exist or else exists and 
is finite, 0(B) = O. 

Proof, re (i). By subtracting from the function 0 the function 
k' p, we may assume that k= 0. Let e be any positive number. By 
Lemma 15.1 there exists an open set O such that 

(15.5) W (0; (? — J.)<e and W (0-, A — 0)<e. 

Let 9R, be the set of the meshes M of the net 931, such that 

(15.6) MQG and 0(M) ^ — e-p(M), 

and generally, for n>l, let 93) n +i be the set of the meshes M of the 
net 95ln+i which fulfil the conditions (15.6) and are not contained in any 
of the meshes of 991,+991,-f ...+9Jl n . By arranging the sets belonging 
to 951,+95l J -f-...+95l„+... in a sequence (if*), we have 0{M k )>—e-p(Mk) 
for k= 1, 2, ... , and A • (? G Jvif*. Since the sets if* are measurable (33) 

and no two of them have common points, it therefore follows 
from (15.5) that 0(A) = 0(A-G)+0(A — Q)>0{EM k ) — 2e^ 

k 

^ — e ’Lp(M k ) — 2e^ — c-[ j «(G t )-f 2], and so that 0(A)^O. 
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re (ii). Denote, for any positive integer n, by B n the set of 
the points x of B at which T>0(x)^ — n. On account of (i) we have 
0(B n )^ — n-/u(B n ) = 0 for each n, and, since B = lim B nj this gives 

0(B) >0. By symmetry 4>(JS)^0, and so finally 0(B) — 0. 

(15.7) Theorem . If 0 is a function of a set (33), which is additive 
on the space 3/, the derivative (/i,9Jl)D0(,r) exists almost everywhere 
and is integrable (33, /u) on 3/; moreover , if J5 +<x> and 2£-oo denote the 
sets of the points at which (//, 9Jl)D0(a?)= + °° and (//, 9)l)D0(a?) = — oo 
respectively , we 

(15.8) 0(Z)=0(A'.£ + co)+0(Z-^-oo)+/(Ai,9R)D<P(*)d^(jr) 
and 

(15.9) W(0; X)=0(X-E +oo )+\0{X^E- ao )\+f\{ f t,m)-D0(x)\dfi(x) 

x 

for every set X measurable (*©). 

Proof. By Theorem 14.6, Chap. I, there exist a function of 
a point / integrable (*$3, y) on M and an additive function of a set & 
singular (33, y) on M such that 

(15.10) 0(X) = S(X) + f f(x) dy(x) for every set X e 1 ©. 

,v 

Let E be a set measurable (93) such that y(E )= 0 and that 
the function S vanishes identically on C E. Writing, for brevity, 
D, D and D in place of (//,!R)D, (y, TR)D and (/<,TR)D respectively, 
let us denote for any pair of integers »>0 and k, by P„,* the set of 
the points x at which D0(x)^(k-{-l)ln>kln'^f(x). If we substitute 
P„,k-CE for X in (15.10), we find on account of Lemma 15.4 (i) that 

0(P n . k -CE)^^y(P„. k -CE)^- • y u(P„, A -CP)> (f{x)dy(x)=0{P n , h -CE), 
n n P,JCE 

and so that y(P n ,k)=y(Pn,k-QE)~ 0. Therefore D0(x) ^f(x) at 
almost all points x. By symmetry D0(x) ^ f (x) must also hold 
almost everywhere in M. Therefore the derivative D0(x) exists and 
equals f(x) at almost all the points x of M, and the identity (15.10) 
takes the form 

(15.11) 0(X)=8(X)+f D0d/u =0(E-X)+j D0d/i for every set Je93. 

x x 

Moreover, since D0(x)—f(x)^=oo almost everywhere, the set E +00 +E - „ 
is of measure (y) zero, and it follows directly from (15.11) that the 
function 0 vanishes identically on the set (E +ao +E^o) — E. 
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On the other hand, by Lemma 15.4 (ii), 0 vanishes identically 
on E — (E+oo+E.oo). Therefore in (15.1 1) the set E may be replaced by 
the set Z? + oo+25-oc, and the relation (15.11) becomes the required 
formula (15.8). Finally, since by Lemma 15.4 (i) the function 0 
is non-negative for the subsets (33) of E +«. and non-positive for 
the subsets (33) of E «>, formula (15.9) follows at once from for- 
mula (15.8). 

Let us mention specially the following corollary of Theorem 15.7: 

(15.12) Theorem, Suppose given in the space M two regular sequences of nets 
and Sp, and , as before , a measure p defined for the sets (33) and subject to the 
condition /*(>/)< -f oo. Then for every function 0 of a set (33), which is additive on >/, 
we have almost everywhere (,«, $l)D0{x) — {p, Sp)D^(jr); moreover , if E denotes 
the set of the points x at which either one at least of the derivatives (p, 9l)D#(a;) 
and ( p , ty)l)0(x) does not exist , or else both exist but have different values , then 
the function 0 vanishes identically on E , i. e. W (0; E) = 0. 

In fact, if we write, for brevity, D x and D 2 in place of (/#, 9l)D and (p, $)D 
respectively, and if we denote by the function of singularities of 0 f we have 
by the previous theorem 

0(X)=&(X)+)D l 0(x)dp(x)=to(X)+JD % 0(x)dp(x) 

X X 

for every set X measurable (13). Equating the two integrals which occur in this 
relation, we obtain almost everywhere D 1 0(x) — D 2 0(x). 

Now the set E of the points at which this relation does not hold, may be 
expressed as the sum of three sets A lt A 2 and A s , where A v is the set of the points 
x * E at which one at least of the derivatives D x 0(x) and D t 0(x) does not exist, 
or else exists and is finite, ^4 2 the set of the points x at which D x 0(x)~- foo and 
— oo, and A z the set of the points a; at which D x 0(x)~ — oo and D z 0(x) — -\- oo. 
It follows directly from Lemma 15.4 (ii) that the function 0 vanishes identically 
on A v In the same way, it follows from part (i) of this lemma that we have 
simultaneously 0(X)rO and 0(X) ^0, and so 0(X) = O, for every subset X 
measurable (33) of ^4 2 or of .1 3 . Consequently W(0; E)=0, and this completes 
the proof. 

Theorem 15.7, which corresponds, to a certain extent, to Theorem 9.6, 
was first proved by Ch. J. de la Valine Poussin [1; cf. also I, p. 103] for 
derivation with respect to the Lebesgue measure, and with respect to the regular 
sequences of nets of half open intervals in Euclidean spaces. Strictly, the Lebes- 
gue measure does not fulfil the condition which we laid down for the measure p, 
since Euclidean space has infinite Lebesgue measure. Nevertheless it is easy 
to see that for the validity of Theorem 15.7 (as well as for that of the other pro- 
positions of this §) it suffices to suppose only that the meshes of the nets considered 
have finite measure. 

For the derivation of additive functions of a set in abstract spaces, see 
also R. de Possel [1]. 
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* $ 16. Torus space. As an example and an application of the 
results of the preceding §, we shall discuss in this § a metrical space 
which, from the point of view of the theory of measure and inte- 
gration, may be considered as one of the nearest generalizations 
of Euclidean spaces. This space, called torus space of an in- 
finite number of dimensions, occurs in a more or less ex- 
plicit form in the important researches of H. Steinhaus [2], of 
P. J. Daniell [2; 3], and of other authors, in connection with 
certain problems of probability; but the first systematic study of 
this space is due to B. Jessen [2]. 

Following Jessen, we shall call torus space Q (l > the metrical 
space whose elements are the infinite sequences of real numbers 
£=(x 1J x 2J ... j % nj where for n = 1,2,..., the distance 

g(£, rj) of two points $={x v x 2 , ... , x nj ...) and rj=^(y v y 2 , ... , y n , ...) 
in Q i0 being defined by the formula p(f, rj)=y\y n — x n \f2 n . By Q m , 

n 

where m is any positive integer, we shall denote the half open cube 
[0, 1; 0, 1; ...; 0, 1) in the Euclidean space M m . If g=(x lJ x 2J ... , ...) 

is a point of ( r J w , we shall denote, for any positive integer m, by tj m 
the point (x l9 x 2 , ..., x m ) of (J my and by the point (#,„+!, x m + 2r ..) 
of Qtaj and we shall write f=(f m , f"). According to this notation, 
(1,7/) is a point of (J o> whenever f e<J m (where m is any positive integer) 
and rjeQoy. So that, if A(ZQ m and B C Q<o , the set A x B (cf. Chap. Ill, 
§§ 8, 9) lies in the space and in particular Q m xQto—Qo>. 

We shall call closed interval , or simply interval , in the space 
( % , any set of the form I x Q<o, where I is a closed subinterval of Q m 
for some value of m= 1, 2, .... Similarly, taking I to be an interval 
which is half open (on the left or on the right) in Q m , we define in 
the space Q<,> the half open intervals (on the left or on the right). 

Every (closed) interval J in Q w has only one expression of 
the form I x Q i0 where I is an interval in a space Q m . (It is to be re- 
marked that the space Q w itself is not a closed interval in the sense 
of the definitions given above.) By the volume of the interval J— I x Q M 
we shall mean the volume of the interval I in Q m QRm (cf. Chap. Ill, 
§ 2). Just as in Euclidean spaces, the volume of an interval J in Q i0 
will be denoted by \J\ or L W (J). Again, as in Euclidean spaces 
(cf. Chap. Ill, § 5), we shall extend the notion of volume in the space 
Qo> by defining for every set E in this space the outer measure L Z(E) 
of the set E as the lower bound of the sums where [J k) is 

k 

any sequence of intervals such that EC.2jJ Thus defined, the 
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outer measure evidently fulfils the three conditions of Carath4odory 
(cf. Chap. II, § 4) and determines, first the class of sets measurable 
(Cl* ), and then the class of functions measurable (Ci.;J. For brevity, 
the sets and the functions belonging respectively to these classes, 
will simply be termed measurable. Also by the measure of a set E 
in the space Q w we shall always mean its measure (L*). 

It is easily shown, with the help of Borel’s Covering Theorem, 
that the measure of any closed interval coincides with its volume 
(cf. Chap. Ill, § 5, p. 65), so that we can, without ambiguity, write 
\E\ or L t0 (E) (omitting the asterisk) to denote the outer measure 
of any set E in Qu>. We also see that the boundary of any closed 
interval is of measure zero. Finally, we remark that the whole space Qu, 
is of measure 1. 

We shall now 7 define in Q,., a regular sequence of nets (cf. § 15, 
p. 153) of intervals half open on the right. We shall, in fact, denote 
for any positive integer m, by Q <m> the finite system of 2"' intervals 
half open on the right 

[*,/ 2 m , (ft,+l)/2 m ; ft 2 /2"', (k 2 -\-l) 1 2 m •, ...; k m /2 m , (* m +l)/2 m ) X Q.„ 

where the fc ( are arbitrary non-negative integers less than 2 m . 
We see at once that each system Q < "‘ > is a net in Q lo . To see 
that the sequence of these nets is regular, we observe in the first 
place that each interval of Q <m H> is contained in one of the intervals 
of Q 1 "'*. On the other hand, no interval of the net Q ,m) can have 

m °° . 

a diameter exceeding the number Vl/2 m l/2 m , so that 

k~ 1 k — in - f 1 

the characteristic number zl(Q <m> ) of the net Q ( "° tends to zero as m —> oo. 

If x and y are two real numbers, ./; ! // will denote the number 
x+y — [#+y], where, as usual, [j?-f y] stands for the largest integer 
not exceeding x + y. If f=(a?„ ... , x. n ...) and rj^(y v y v ...) 

are two points of Q u > , we shall write for (^-f y ]} x 2 +y 2 , ..., x n +y 
The point clearly belongs to Q<>. 

We shall call translation by the vector a, where a is a point of Q< 0 , 
the transformation which makes correspond to each point ? of Q l0 
the point $+a. The translation by the vector a will be termed of 
order m, if all, except at most the first m, coordinates of a vanish. 

A function / in Q i0 will be termed cylindrical of order m y if /(f) 
does not depend on the first m coordinates of the point f, i. e. if 
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/(£)=/(£+“) identically in f, for every point a whose coordinates, 
except perhaps the first to, all vanish. A set E in Q 0 will be termed 
cylindrical of order to, if its characteristic function is so, or, what 
amounts to the same, if E=Q m xA where A is a set in Q a . 

(16.1) Theorem. A function which is measurable on Q 0 and cylindrical 
of every finite order, is constant almost everywhere, i. e. /(f)=c for 
almost all points f of Q 0 , where c is a constant. 

Proof. Suppose first that the function f is bounded, and 
therefore integrable, on Q w . Denoting by <P the indefinite integral 
of /, let us define, for each value of w and for each mesh Q of the 
net Q <m> , f m) (£)=0(Q)/L, J (Q) whenever £eQ. For every pair of 
meshes Q x and Q 2 of the same net Q (m) , there always exists a trans- 
lation of order to which transforms Q x into Q 2 i therefore, since the 
function / is cylindrical of order to, it follows that 0(Qi)=0(Q 2 )', 
and since further L M (Q 1 )=L W (Q 2 ), each of the functions / (m) (f) is 
constant on Q M . On the other hand, we deduce from Theorem 15.7 
that /(f) == lim / <m) (f) almost everywhere in Q 0 , i. e. that the function 

m 

is almost everywhere identical with a constant. 

Now let f be any measurable function which is cylindrical of 
every finite order. Let us write /„(f)=/(f) when !/(f)|^» and 
/(f)=« when |/(f)|>«. Each of the functions /„(f) is bounded and 
cylindrical of every finite order, so that by what has just been proved, 
each of these functions is constant almost everywhere. Therefore 
the same is true of the function /(f) = lim /„( f) and this completes 
the proof. " 

The fundamental properties of our measure in the space Q 0 
may be established by methods similar to those used in Euclidean 
spaces. To illustrate this, let us enumerate some of these properties. 

Given any measurable set E and any e>0, there exists a closed 
set F and an open set G such that F(ZE(2G and such that \G — E \ < e 
and | E — F\<e (cf. Theorem 6.6, Chap. III). 

From this we may deduce next Lusin’s theorem (cf. Theorem 7.1, 
Chap. Ill): If f is a finite function measurable on a set E, there exists 
for each e>0, a closed set FQE such that the function f is continuous 
on F and that \E — F|<e; and its immediate corollary: any function 
which is measurable in Q 0 , is equal almost everywhere in Q 0 to a function 
measurable (93). Finally Fubini’s theorem (cf. Chap. Ill, § 8) may 
be stated as follows for the space Q u : 
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(16.2) Theorem. If f is a non-negative measurable function in the 
space Q,o, then for any positive integer m, 

(i) the definite integral j /(£, exists for every f t Q m , 

o' 

except at most for those of a set of measure (L,„) zero, 

(ii) the definite integral ff(£,r))d L,„(|) exists for every y e Q (0 , 

Qn, 

except at most for those of a set of measure (L,„) zero , 


(iii) //(0dL w (0= /’[ ff(£,y)dL l0 ( V )]dL m (Z)= j[J'f(Z,ri)dLM)]dL„(r))' 


Proof. We begin by verifying this directly when / is the 
characteristic function of a closed interval, or of a half open interval, 
and then successively when / is the characteristic function of an 
open set, of a set (©,0, of a set of measure zero, and finally of any 
measurable set. It follows at once that the theorem is valid in the 
case where / is a simple function, and then, by passage to the limit, 
in the general case where / is any non-negative measurable function. 

The line of argument that we have sketched, does not differ substantially 
in any way from the proof of Fubini’s theorem for Euclidean spaces, and is even 
in a sense simpler than the latter, since in proving Theorem 8.1 of Chap. Ill we 
had to allow for the possibility of there being h.yperplanes of discontinuity of 
the functions U and V. 

In the space *},,> there is, however, as shown by B. Jessen [2, p. 273], 
another theorem of the Fubini type, whose proof requires new methods. This 
theorem allows integration over the space tt 0) to be, so to speak, reduced to 
integrations over the cubes Q m in Euclidean spaces, whereas each of the three 
members of the relation (iii) of Theorem 16.2 contains an integration extended over 
the space * 

(16.3) Jessen's theorem. If f is a non-negative measurable function 
in the space Q tli1 the integral 

(16.4) /„(£)= //(*, Qdh „,(£), where m= 1, 2, ... , 

d m 

exists , and we have 

( 16 .:») lim /,„(£)= /V(£)dL w (£), 

CL 

for almost all C in Q,.,. 
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Proof. Let us first remark that if Q is a set of measure zero 
in Qu,, it follows from Theorem 16.2, applied to the characteristic 
function of Q, that for any m whatever, the set E [(f, £<■ Q m ] 

is of measure (L m ) zero for almost all rj of Q 0 >. Hence (with the not- 
ation adopted p. 157) we also have L m (E[(£, £")€<?]}= 0 for almost all 

C of Qo>. It follows that if g and h are two non-negative measurable 
functions which are almost everywhere equal in Q a , the integrals 
j g(£, C”)dL m (£) and jh(£,C' m )dL m (£) are equal for almost all the C 
Q m <i m 

of Qo,, whatever m may be. We may therefore, without loss of gen- 
erality, assume in the proof of Theorem 16.3 that tue given func- 
tion / is measurable (■©); for any measurable function is almost 
everywhere equal to a function measurable ('B). 

The integral in the formula (16.4) then clearly exists for every f, 
and moreover it follows directly from this formula that the function 
/„(?) is cylindrical of order m. The upper and lower limits of 
the sequence {/ m (C)} are thus cylindrical of every finite order and 
by Theorem 16.1 we may write almost everywhere in Q,„ 

lim inf f m (£)=A and lim sup f m (£)=B 

m m 

where A and B are constants. It remains to be proved that A=M=B, 
where M denotes the integral on the right-hand side of (16.5). 

We shall prove in the first place that A^M. For this purpose, 
let A’ be any number exceeding A (if A — - f oo our assertion is 
obvious), and write 

m 

(16.6) P*=E[/*(CK4'], and S = \imS m . 

5 fc=l m 

The set 8 coincides, except for a set of measure zero, with the whole 
space Q„. Keeping an index m fixed ; let us evaluate the integral of 
f m overS m . Writing R m =P m ,R m -i=P m ~i CP m ,...,R l =P 1 CP 2 -... GP m , 
we have 

III 

(16.7) 8 m =£R), and (16.8) R r Rj = 0 whenever Kli<j^m. 

k~l 

On the other hand, since every function /* is cylindrical of order k, 
so are the sets P* and OP* and therefore the sets P* for &=1, 2, ... , m. 
We may thus write (cf. above p. 159) P*=Q*xP* where i?*C 
According to (16.6), we have /*(£) ^ A' for every f f P* (3 P* 
where k=l, 2, ... , m, or, what amounts to the same by formula (16.4), 
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for every r]eR k . Therefore, on account of 

% 

Theorem 16.2, we obtain for k= 1, 2, ... , m, 

f m dL u (C)= /[ fm V) dU(t) } dli w (r]) ^A’-U 0 (Rk)=A’-L u (R k ), 

** R k Q k 

whence it follows by ( 16. 7 ) and (16.8) that J /(C)dL, 0 (C )^4. ' • L w ( 8 m )^A 

S " 

Making m-+oo, we obtain in the limit M = f f (£) dL w (C) ^ A' and 
so M^A. Q u 

By symmetry M^B and, since it is clear that A^B, this 
requires A = M—B and completes the proof. 



CHAPTER V. 

Area of a surface * = F(x, y). 

§ 1. Preliminary remarks. We saw (cf. Chap. IV, § 8) that 
the Lebesgue theory enables us to solve completely the elementary 
problems concerning the length of a curved line and the expression 
of this length by an integral. However, similar problems concerning 
curved surfaces involve difficulties of a much more serious kind. 
Certain classical treatises on the differential and integral calculus, 
even in the second half of the XlX-th century, contain an inaccurate 
definition of the area of a surface. By analogy with the definition 
of length of a curve, the authors attempted to define the area of 
a surface as the limit of the areas of polyhedra inscribed in the 
surface and tending to it. H. A. Schwarz [I, p. 309] (cf. also 
M. Fr^chet [3]) was the first to remark that such a limit may 
not exist and that it is possible to choose a sequence of inscribed 
polyhedra whose areas tend to any number not less than the 
actual area of the surface. About the same time Peano and Her- 
mite subjected the old definition to similar criticisms and proposed 
new definitions based on quite different ideas. It was H. Lebesgue 
who first returned in his Thesis [1] to the old method, in a modified 
form that may be roughly described as follows: the area of a surface 
is the lower limit of the areas of polyhedra tending uniformly to 
the surface in question (without, however, being necessarily in- 
scribed in the latter). 

Nevertheless, in the more general case in which the surface 
is given parametrically, this definition requires various additional 
notions and considerations (cf. T. Radd [I; 1; 4]) and the results 
obtained are far from being as complete as those available 
for curves. The difficulties that arise belong to Geometry and 
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Topology rather than to the Theory of functions of a real vari- 
able. (For the special case in which the functions x=X(u, v), 
y=Y(u,v) and z=Z(u, v) which define the surface parametrically 
fulfil the Lipschitz condition vide T. Bado [4] and H. Badema- 
cher [4]). 

We shall therefore restrict ourselves to the case of continuous 
surfaces of the form z—F(x, y). The most elegant and the most 
complete results concerning these surfaces are due to L. Tonelli 
[5; 6; 7]; they will be given in § 8 and are the principal object of this 
chapter. 

Tonelli’s theory is based on the definition of area proposed by Lebesgue. 
As regards the modern work on area of surfaces based on other definitions, we 
should mention: W. H. Young [4], J. C. Burkill [3], S. Banach [5], A. Kol- 
mogoroff [3] and J. Schauder [1]. 

T. Bado fl, pp. 154 — 169; 2] has developed further the 
methods of Tonelli by means of older ideas due to de Geocze 
and with the help of certain functionals introduced by the latter. 
The principal result of Bado [vide Theorem 7.3), applications of which 
will be discussed below, enables us to define the area of a surface 
as the limit of certain simple expressions, whereas the Lebesgue 
definition only enables us to obtain it as a lower limit. An- 
other expression is due to L. C. Young (vide below § 8) and 
constitutes a direct generalization of the classical formula for the 
area of a surface. 

Except where the contrary is expressly stated, the reasoning 
of this chapter will be formulated for functions of two real variables. 
The extension to spaces of any number of dimensions offers no 
difficulty. 

$ 2. Area of a surface. By a continuous surface on a plane 
interval I 0 , we shall mean any equation of the form z=F(x, y), 
where F is a continuous function on I 0 . 

A continuous surface z=P(x, y) on an interval I 0 is termed 
polyhedron if there exists a decomposition of I 0 into a finite 
number of non-overlapping triangles T v T t , ..., T„ such that the 
function P is linear on each of these triangles, i. e. such that 
P(x,y)=a,x+b,y+c, for ( x,y)eTi , where i=],2, ..., n and «/, b if ct 
are constant coefficients. We shall call, respectively, faces and ver- 
tices of the polyhedron z=P(x, y), the parts and the points of the 
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graph (cf. Chap. Ill, § 10) of the function P, which correspond to 
the triangles T,, and to the vertices of the T,-. The sum of the areas 
of the faces in the sense of elementary Geometry, i. e. the number 

2W(a?+6?+l)*= ff[(9P/9x) 2 + {SP^yf+lfdxdy, will be called 

elementary area of the polyhedron z=P(x, y) on J 0 and denoted 
by S 0 (P; I 0 ). 

Given any continuous surface z=F(x,y) on an interval I 0 , we 
shall term its area on and denote by S (F;I 0 ), the lower limit of 
the elementary areas of polyhedra tending uniformly to this sur- 
face, i.e. the lower bound of all the numbers s for each of which 
there exists, given any e>0, a polyhedron z—P(x y y) on I 0 such 
that 1° | P(x,y) — F(x,y)\<e at every point (x, y) e I 0 and 
2° S 0 (P;I 0 )<*. 

We might verify Here that for polyhedra the elementary area agrees with 
the area according to the general definition just given. As, however, this is an 
easy consequence of the theorems given further on ( vide p. 181 ), a special proof 
is unnecessary at this point. It should be remarked that, in accordance with the 
definition adopted, the area of a surface may be cither finite or infinite. 

The following theorem is an immediate consequence of the 
definition. 

(2.1) Theorem. For any sequence of continuous functions {F„} 
which converges uniformly on an interval l 0 to a function F, we have 
liminf S(F„; J 0 )^S(F; I 0 ). 

n 

§ 3. The Burkill integral. Instead of treating the theory 
of area of surfaces by itself, it is more convenient to associate it 
with certain differential properties of functions of an interval. How- 
ever, the functions of an interval occurring in the theory of area 
are not in general additive, and in consequence we shall have 
to complete in some minor points the theory of functions of an 
interval, developed in the two preceding chapters. 

We shall begin with some subsidiary definitions. To simplify 
the wording we shall understand in the sequel by subdivision of 
a figure E 0 any finite system of non -overlapping intervals I ly I 2 , ••• > In 
such that R 0 =£lk- Given any function of an interval U and given 

a finite system of intervals 3 ={Ia}> we shall write, for brevity, U (3) 

in place of £U(Ik). In particular therefore, L(3) will denote the 

* 

sum of the areas of the intervals belonging to the system 3- 
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We call upper and lower integral in the sense of Burkill of 
a function of an interval U (I) over a figure i2 0 , and we denote by 

f V and J U respectively, the upper and the lower limit of 

R, R {l 

the numbers U( 3) for arbitrary subdivisions 3 of R 0J whose charac- 
teristic numbers 4(3) tend to zero (cf. Chap. II, p. 40). When 
these integrals are equal, their common value is called the Burkill 
definite integral (or simply the integral) of the function U over R 0 
and is denoted by J U. If this integral exists and is finite, the func- 

R o 

tion U is said to be integrable on R 0 (in the sense of Burkill). If the 
function V is integrable on every figure R (in the whole plane or 
in a figure 2? 0 ) its integral j U considered as a function of the figure 

R 

R is called indefinite integral of U (in the whole plane or on R 0 ). 

(3.1) Theorem 1° If U is a function of on interval and R x , R t 
are non-overlapping figures , we have 

(3.2) f U >f U +f U and f U <f U +f U ’ 

R{ Ri Ri-\-R-ji /?2 

provided that both integrals of V over are finite . 

2° Any function of an interval U which is integrable on a 
figure R 0J is equally so on every figure RQR 0 an d its indefinite 
integral on R 0 is an additive function of a figure . 

Proof, Part 1° of the theorem is a direct consequence of the 
definition of the Burkill integrals, and part 2° follows at once from 
the formulae (3.2) when we subtract the second of these formulae 
from the first. 

If 17 is a function of an interval on a figure R , we shall call 
variation of V on R at a set D the upper limit of 1 17(3)1 as 4(3) ->0, 
where 3 denotes any finite system of non-overlapping intervals 
contained in R and possessing common points with D. The follow- 
ing analogue of Theorem 4.1, Chap. Ill, may be noted. 

(3.3) Theorem . Given on a figure R 0 a function of an interval U 

such that j\U\< + oo, there can be at most an enumerable infinity of 
R« 

straight lines D , which are parallel to the coordinate axes and at 
which the variation of U on R 0 is not zero . 
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In fact, the number of straight lines which are parallel to the 
axis of x or of y , and at which the variation of U on R 0 exceeds 

a positive number e , cannot be greater than 2e ~ l • f \U\ < + oo. 

R 0 

(3.4) Lemma. Given a function of an interval U integrable on 

a figure R 0 , there exists , for each e > 0, an rj > 0 such that for every 
system ^={1^ I 2 , , I P ) of non -overlapping intervals situated in R 0 , 

the inequality J(3)<^ implies the inequality 

p 

(3.5) \2j[ U(I,>) -f U l\< e - 

*= i 

Proof. Let 7?>0 be a number such that, for every subdivision 
% of R 0 , A(%)<r) implies |f/(X) — f U\ < e/2, and let ^=[I lf I 2 , ... , I p } 

be any finite system of non -overlapping intervals situated in R 0 , 

p 

such that d(3) < r\. Let R t =R 0 Q JP/*. By Theorem 3.1, the function 

TJ is integrable on R v It follows that there exists a subdivision 3i 
of R t such that 

(3.6) 4(3!) <>? and \U(^)-[U\ < e/2. 

Now 3 + 3i clearly constitutes a subdivision of R 0 such that 
4(3 + 3i) <»?• We therefore have |Z7(3 + 3j) — / U\<e/2, and we need 

only subtract the second of the relations (3.6) from it to obtain (3.5). 

If R 0 is a fixed figure, then to any tj> 0 there corresponds 
a positive integer p such that every interval I(ZR 0 may Be sub- 
divided in p subintervals of diameter less than r\. Hence applying 
Lemma 3.4, we obtain at once the following 

( 3 . 7 ) Theorem. If a function of an interval TJ (I ) which is in- 
tegrable on a figure R 0 , is continuous, then the same is true of its in- 
definite integral B(R) = J TJ. 

H 

( 3 . 8 ) Theorem. If TJ is a function of an interval which is in- 
tegrable on a figure R 0 and if B is its indefinite integral , then 
B( x ,y)=U(x,y) and B(x,y)=U(x,y) at almost all points (x,y)eR 0 . 

In particular therefore, if one of the functions TJ and B is almost 
everywhere derivable in R 0 , the same is true of the other and the deriv- 
atives of TJ and of B are almost everywhere equal. 
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Proof. Suppose that the set of the points (x, y) at which 
V (x, y ) > B(x, y) has positive measure. We could then determine 
a set EQR 0 of positive outer measure and a number a> 0, such 
that U(x, y) — B(x, y)> a at each point (x,y) of E. Therefore, on 
account of Vitali’s Covering Theorem (Chap. IV, Theorem 3.1), 
we could determine in R 0 , for any rj > 0, a finite system of non- 
overlapping intervals 3 = !Ia}a=j, 2 ,..., « such thatd(3)<>?, L(3)>'12;/2, 
and U(Ik) — a'\Ik\ for fc=l, 2 , ...,«. Now it follows from 
the last two relations that f7(3) — .6(3) > a-\E\/2, which contradicts 
Lemma 3.4. Hence, U{x, y)<i B(x, y) almost everywhere in R 0 . 
In the same way we prove that the opposite inequality holds also 
almost everywhere in R 0 , and this completes the proof. 

(3.9) Theorem. Suppose that U is a continuous function of an 
interval on a figure R 0 and that (i) J |E7|<-foo and (ii) U (I)^U(Q) 

for every interval KZRq an d every subdivision 6 of I. Then the func- 
tion U is integrable on R 0 . 

Proof. Given a number e>0, let $ = , P be a sub- 

division of R 0 such that 

(3.10) U(1)>jU-e. 

H., 

Let us denote by D\, D-i , ... , D r the sides of the intervals (X) which 
do not belong to the boundary of R 0 . By Theorem 3.3 it may 
be assumed, in view of the continuity of the function U and of 
condition (ii), that the variation of U on R 0 vanishes at each side D t . 
It follows that there exists an g > 0 such that, given any finite 
system S of non-overlapping intervals situated in h‘ n and having 
points in common with the sides Z>„ the inequality A (S) < r\ im- 
plies |?7(6)|<e. We can clearly assume that r; does not exceed 
the length of any side of the intervals ($). 

This being so, consider an arbitrary subdivision 3 = {Ij, I 2 > ••• > In) 
of R 0 such that d(3)< rj. By numbering the intervals of 3 suitably, 
we may evidently suppose that 7j, I 2 , ... , 1,, are those having 
points in common with the sides D„ while the remaining intervals 
of 3 (if any) have none. Finally, let us agree to write U(JiQIk )— 0 

when «//©I* = 0. Then |j;f7(J*)|< e and \% £U(J, -)/*)| < e, so 

a- 1 , - 1 1, i 
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that, by (3.10) and by condition (ii) of the theorem, we have 

fU — e<V(%)^U(^)—ZU(Ik)+S %U(J,QI k )<U($) + 2e. It 

i *=• *=• 

follows that J U ^ J U +3e, and so, that j U = J U. 

Rq /?o Ro R(J 

In connection with this §, vide J. C. Burkill [2; 3; 4], R. C. Young [2] 
and F. Riesz [6; 7], 

# 4s Bounded variation and absolute continuity for 
functions of two variables* Given a function F(x , y) contin- 
uous on an interval I~[a ly b x ; a 2) & 2 ], ^ us denote for any value x 
subject to a x ^x<^b x , by W 1 (.F; x\ a 2 , b 2 ) the absolute variation 
of the function F(x , y) with respect to the variable y on the interval 
[a 2 , b 2 ]j and for any valuer subject to a 2 ^y^b 2J by W 2 (i^; y; a x , b x ) 
that of the function F(x,y) with respect to x on [a x ,b x ]. Denoting 
by J x and J 2 respectively the linear intervals [a xi b x ] and [u 2 , b 2 ] 
we shall also write W x (Fyx)J 2 ) for W X (J?; x\ a 2J b 2 ) and W 2 (F;y;J x ) 
for W 2 (f; y, a lt b, ,). 

By continuity of the function F, the non-negative expressions 
W x (F;x;J 2 ) and W 2 (F;y;J x ) are, as is easily seen, lower semi- 
continuous functions of the variables x and y respectively. When 

b x b 2 

the integrals fw x {F;x;J 2 )dx and j W 2 (jP; y\ J x )dy are both finite, 

Ox (l> 

the function F is said to be of bounded variation on I in the Tonelli 
sense. It follows at once that any function of bounded vari- 
ation of two variables x, y is of bounded variation with respect 
to x for almost every value of y and with respect to y for almost 
every value of x. 

A continuous function F{x,y) will be termed absolutely con- 
tinuous on an interval 7=[a 1? b x \ a 2 , b 2 ] in the Tonelli sense, if 
it is of bounded variation on I and moreover, absolutely continuous 
with respect to x for almost every value of y in [u 2 , 6 2 ], and absolute- 
ly continuous with respect to y for almost every value of x in [a x , b x ]. 

We say that a function F(x , y) fulfils the Lipschitz condition on J, 
if there exists a finite constant N such that \F(x\y')~ F(x ,, 1 y")\^ : 

y' — y"\) for every pair of points (x', y') and 

(»",*") of J. 

Any function which fulfils the Lipschitz condition on an 
interval I is evidently absolutely continuous on I. In particular 
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polyhedra and also functions of two variables with continuous 
partial derivatives, are always absolutely continuous functions. 

A function F which is continuous and of bounded variation 
[absolutely continuous, or subject to the Lipschitz condition] on an 
interval / 0 — t a iA> * 2 ] can easily be continued, even so as to 
be periodic, over the whole plane in such a manner as to remain 
continuous and of bounded variation [absolutely continuous, or 
subject to the Lipschitz condition] on every interval. In fact, de- 
noting by Z x one of the intervals congruent to J 0 with a common 
side parallel to the axis of a?, let us continue the function F from 
the interval Z 0 on to the interval I x by symmetry relative to the 
common side of these intervals. Let us further continue similarly 
the function F from the interval I 0 +Ii on to an interval Z 2 con- 
gruent to I 0 +Ii which has with the latter a common side parallel 
to the axis of y . The function F is then defined on the interval 
Zo+/i+/ 2 whose sides are respectively of lengths 2 -(b x — a x ) and 
2 (6 2 —a 2 ). Writing u=2(b 1 —a 1 ) and v = 2-(b 2 — a 2 ), and continuing 
the function F from the interval Z 0 +/ 1 -hZ 2 on the rest of the 
plane by the periodicity condition F(x+ u, y)—F(x y y+ v)—F(x, t/), 
we see easily that the continuation obtained for the function F 
has the properties required. 

Besides the definition of Tonelli several other definitions have been given 
of conditions under which a function of two variables is said to be of bounded 
variation. For a discussion of these definitions see C. R. Adams and 
J. A. Clarkson [1;2]. Throughout this chapter, use is made of Toneili’s 
concept only. 

We shall subsequently make use of the following theorem 
concerning the partial derivates of any continuous function: 

(4.1) Theorem . Given a continuous function F(x,y ), its partial 
Dini derivates , Ft, Fx , Ft, F~ and Ft, F u ,Fy,F^, are func- 
tions measurable (33). 

Proof. It will suffice to prove this for any one of these deriv- 
ates, say Ft. 

Given an arbitrary real number a, consider the set 

E= E [ Ft(x,y ) < a], 

U,y) 

and denote by E„ the set of all the points (x, y) such that for every h 
the inequality 0<fc^l/n implies [F(x-\-h,y) — F(x, y)]jh^a — 1/n. 
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We find that E—}jE„ and, since by continuity of the function F 

n 

each of the sets E n is closed, E is a set (5a), so that the derivate Ft 
is a function measurable (*33). 

Theorem 4.1 may be compared with Theorems 4.2 and 4.3 of Chap. IV 
concerning measurability of the derivates of functions of one real variable. Never- 
theless it is to be remarked that contrary to what occurs for functions of one 
variable, the partial Dini derivates of a function measurable (©) need not in 
general be measurable (S3), although they are still measurable (0) (the proof of 
this requires, however, the theory of analytic sets; vide F. Hausdorff[II, p. 274], 
M. Neubauer [1] and A. E. Currier [1]). On the other hand, a function of two 
variables may be measurable (£) without its partial Dini derivates being so. 


$ 5. The expressions of de GeScze. We shall make 
correspond to any function F{x, y) which is continuous on an interval 
1= [«i, b 2 ; a 2 , b 2 ], the following expressions introduced by Z. de 
Geocze [1] into the theory of areas of surfaces: 


b, 


I)= f | F(x, b 2 ) — F(x, a 2 )\dx, G 2 (F; 1)=/ 1 F(b v y)-F(a 1 ,y)\dy , 

a x a. 

G (F; /)={[G 1 (P; I)] 2 +[G 2 ( F; /)f+|Z| 2 } ,/2 . 

While studying the fundamental properties of these expressions, 
we shall often find the following two inequalities useful: 


(5.1) 


" \i . . £ x2,l/2 


1(2’®/) +(23//) +(2*/) 1 <2(®?+y 2 + 2 /) 

1=1 1=1 1=1 1=1 


1/2 


for any three sequences {»/}, {yi) and {z/} of real numbers ; 

(5.2) [_[J xdt ) +[Jy dt ) +(/ zdt ) ] (x t +y i +z i )'" 1 dt 

BEE E 

for any measurable set E in a space lt„, and any three non-negative 
functions x(t), y(t) and z(t), measurable on E. 

The inequality (5.1) is easily deduced by induction from the 
case n — 2 which can be verified directly. The inequality (5.2), in 
the special case in which the functions x(t), y(t), z(t) are simple, 
is an obvious consequence of (5.1); and we pass at once to the 
general case with the help of Theorems 7.4 and 12.6 of Chapter I. 
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(5.3) Theorem. The expressions of de Geocze G 1 (I)—G 1 (F', I), 
G 2 (I)~ G 2 {F] 1) and G(I)=G (F-, I), associated with a continuous 
function F(x, y), are continuous functions of the interval I and their 
integrals over any interval exist (finite or infinite )\ these integrals 
over any interval / 0 = [ a i> fulfil the following relations : 

b, b. t 

(5.4) J G 1 =Jw i (F-,x;a 2 ,b 2 )dx and J G 2 =J’W 2 (F-,y,a l ,b l )dy 

f o Hi Iff fl-t 

(5.5) ^(ZoK/e,, G 2 (I 0 )^J'G 2 and <?(Z 0 )</<?, 

/o h h 

(5.6) fGt^fG^[G l +fG t + 1/ 0 | where i= 1,2. 

/« h A) h 

Proof. Given an arbitrary e>0, let t]<ie be a positive number 
such that, for any pair of the points (x, y x ) and (x, y 2 ) in J 0 , 

(5.7) \y 2 — y x \<ri implies \F(x, y 2 )—F{x, y 1 )\<e. 

Let M denote the upper bound of F(x, y) on I 0 and consider 
in / 0 an interval 2=[a u /? x ; a 2 , fi 2 ] such that I <1 rf. We then 
have either /?j — a, < ij or fi 2 — a 2 <rj. In the former case, we find 
(? 1 (7)^(^ 1 — a 1 )-2M<2Mrj^.2Me, and in the latter we derive 
from (5.7), G^I)^ (/?, — a 1 )-£^(6 1 — a 1 ) • e, so that in both cases 
G 1 (I)^(2M + b 1 — a 1 )-e. The function G x (l) is therefore continuous. 
The same is of course true of G 2 (I) and the continuity of these 
two functions at once implies that of G(I). 

This being so, we shall show that the functions (? x and G 2 are 
integrable and, at the same time, we shall deduce the formulae (5.4). 

Let {3nr be a sequence of subdivisions of I 0 such that 
lim A (3 n ) = 0 and lim G,(3 n )= / G t ‘, and denote, for any positive 

/I n — 

/. 

integer n and any |e[aj,6j], by F„(|) the sum of the absolute 
increments of the function F(£,y) on the linear intervals cut off 
on the line x=f by the rectangles of the subdivision 3„. We then 
have, on the one hand, 

*, 

(5.8) e' 1 (3„)=_ fv„{S)dS for »= 1 , 2 ,..., 

71 , 

and on the other hand, on account of continuity of the function F , 
lim V„(£)=W 1 (F; £; a 2 , b 2 ) for any f e [a u 6 X ]. Therefore, in virtue of 
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Fatou’s Lemma (Chap. I, Theorem 12.10) and by (5.8), we obtain 

6 , 

jQi > /Wi(jP; £; a 2 , 6 2 ) df . But since G t 1 (3)^/w i (J 7 ;f;a 2 ,ft 2 )df for 
Jo 

— ft, 

every subdivision 3 of 7 0 , we have also j 0^. J W 1 (F; £; « 2 , ft 2 ) d£. 

h a i 

Therefore the function 0 t has a unique integral over I 0 and this 
integral fulfils the first of the relations (5.4). The existence of the 

integral J <? 2 and the validity of the second of these relations are 

h 

deduced by symmetry. 

Let us pass on now to the function 0. We first remark that 
the integral Jo clearly exists in the case in which one at least of 

the integrals f O t and J 0 2 is infinite, and is then also infinite 

i h 

on account of the relations 

(5.9) (?!(/)< <?(/) and 0 2 (I)^0(I) for any interval I . 

In the remaining case, the two integrals in question being finite, 
the evident inequality G(I) ^ 0 1 (I) + (? 2 (7) + |Z| yields 

< 5 - 10 ) fG< ; f0 1 +fO i + \Io\<+oo-, 

*0 h /o 

and on the other hand, for every subdivision 3 of any interval 7 
the equally obvious relations 

(5.11) ^(7X^(3) and G 2 (7)<(? 2 (3) 
lead, in view of the inequality (5.1), to 

(5.12) 0(Z)«?(3). 

Now, continuity of the function 0 being already established, 
the formulae (5.10) and (5.12) imply, by Theorem 3.9, that this 
function has over 7 0 a unique integral. 

To complete the proof we need only remark that the formulae 
(5.11) and (5.12) imply at once the formulae (5.5) and finally that 
formula (5.6) follows directly from (5.9) and (5.10). 

As a corollary of Theorem 5.3, and more particularly as a con- 
sequence of the formulae (5.4) and (5.6), we have: 
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(6.13) Theorem. In order that the function of an interval G(I)=G(F\I), 
associated with a continuous function F(x, y), be integrable on an in- 
terval I 0 (t. e. in order that J (?<+ oo), it is necessary and sufficient 

i . 

that the function F(x, y) be of bounded variation on I 0 . 

§6. Integrals of the expressions of de Geftcze. Given 
a continuous function F(x , y), we shall denote, for any interval I o, 
by H 2 (F; I 0 ) and H(.F; Z 0 ) respectively, the integrals of 

the functions of an interval G 1 {I)=G i (F; I), G 2 (I)—G 2 (F; 1) and 
G(1)=G(F; I) over the interval I 0 . All these integrals exist on 
account of Theorem 5.3 and their importance in the theory of area 
of surfaces is due to the fact that, as will be shown in the next §, 
the number H(F; Z 0 ) coincides with the area of the surface z=F(x, y) 
on Z 9 . 

(6.1) Theorem . For any function F(x,y) which is continuous and 
of bounded variation , the expressions ff 1 (I)—H 1 (F;I), H 2 (I)=H 2 (F;I) 
and ff(I)=H(F; I) are additive, continuous, and non-negative func- 
tions of the interval I, and we have at almost all points ( x,y ) of the 
plane 

(6 2) y)=\F„(x, y)\, H'i(x, y)=\F' x (x, y)\, 

H'(x, y) = {[FAx, y)f + [F y (x, y)f + U 5 • 

Proof. Additivity and continuity of the functions in question 
follow at once from Theorems 3.1 and 3.7 on account of Theorem 5.3. 
We have therefore only to establish the relations (6.2). Now, for 
any interval /=[«,, b 2 , a 2 , 6 2 ] we have according to Theorem 5.3 
and Theorem 7.4 of Chap. IV, the following relation (in which the 
transformation is effected in accordance with Fubini’s Theorem 8.1, 
Chap. Ill, rendered applicable to the partial derivates of the func- 
tion F(x,y) by Theorem 4.1): 

b, b, b, 

/W X (F; x; a 2 , b 2 )dx^f [f\F„(x, y) | dy] dx=f f \F y (x, y)\ dxdy, 

a x a, a 3 I 

whence 

(6.3) H\(x, y)^\F' y (x, y)\ for almost every point (x, y). 

Let us now denote by { J„) the sequence of the linear intervals 
with rational extremities. In view of Theorem 7.4, Chap. IV, we 
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have for »=1, 2, ... and for every linear interval J, 

fw 1 {F;x-,J„)dx=E l (J xJ„)^f j E',(x,y)dxdy= j [ f E\(x , y)dy]dx , 
J JxJ n J J n 

and consequently, for each positive integer n, the inequality 
W^-F; X) J„)^j E\(x, y)dy holds at every point x, except at most 

K 

those of a set E„ of linear measure zero. Therefore, writing E—^E„, 

n 

we obtain the inequality W^F;®;^)^ J H\(x, y)dy, whenever J has 

rational extremities and x lies outside the set E of linear measure 
zero. If we now regard the two sides of this inequality, for a given 
value of x outside the set E, as functions of the linear interval J, 
we obtain by derivation with respect to this interval (on account 
of Theorem 7.9, Chap. IV) for almost all y, the inequality 

(6.4) \F' y (x, y)\^H'i(x, y). 

Therefore, since the derivatives E\{x,y) and F'„(x,y) are meas- 
urable (cf. Theorem 4.1), it follows from Fubini’s theorem (in the 
form (8.6), Chap. Ill) that the set of the points (x, y) at which the 
relation (6.4) is not fulfilled, is of plane measure zero. By (6.3) we 
therefore have almost everywhere the first of the relations (6.2). 

The proof of the second relation now follows by symmetry, 
and that of the third from the remark that if we write G 1 (I)=G 1 (F 4 , 1 ), 
0 2 (I) = G 2 (F;I) and G(I)=G(F-, I), we have by Theorem 3.8, 

E'(x, y)=G'(x, y)=mx , y)] 2 +[^(®, y)] 2 +l}*= 

= {[E\(x, y)P+[ E' 2 (x,y)T+l)l={[F y (x, y)f+[F' x (x, y)] 2 +l}* 


at almost every point (x, y) of the plane. This completes the proof. 


(6.5) Theorem . In order that the function of an interval H (I)=H(jF;1), 
corresponding to a continuous function F(x,y) of bounded variation 
on an interval / 0 =[ a i> &i; a 2 > ^ 2 ]* be absolutely continuous on this 
interval , it is necessary and sufficient that the function F(x f y) itself 
be absolutely continuous ; and when this is the case , we have 


E(1 0 ) = ffm X, y)f + [F y (x, y)f + 1}' /2 dxdy. 


/. 


( 6 . 6 ) 
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Proof. By Theorem 5.3, absolute continuity of the function 
H(I) is equivalent to absolute continuity of the functions S 1 (I) 
and H 2 (I) together. 

Therefore if the function H is absolutely continuous on I 0f 
we have, by Theorem 6.1, for any interval £; «», b 2 ], where 

the relation 

t £ 

/Wi(P; x) a 2 , b 2 ) dx=H 1 (I$)=f j'F u (x, y)\ dx dy=J \F y (x, y )| dyj dx, 

0, 1^ a, a, 

and, taking the derivative with respect to |, we obtain for almost 
every value of x, 

(6.7) W \(F- t x ; a 2 , b 2 )=J | F'„(x, y)\ dy. 

a, 

Now, for any given value of x (for which F(x,rj) is of bounded 
variation in rj) the difference x; a 2 , rj) — J \F y {x, y)\ dy is a non- 

<h 

negative and non-decreasing function of the variable rj (cf. Theo- 
rem 7.4, Chap. IV). It therefore follows from (6.7) that we have 
for almost every value of x, and for any rj e [ a 2 , ft 2 ], 

v 

W X (P; X] o a , ??)= f | F u (x, y)| dy, 
a % 

1. e. that the function W l (F-, x; a 2 , rj), and consequently also F(x, rj), 
is absolutely continuous with respect to rj on [a 2 , 6 a ] for almost 
every value of x. By the symmetry of the variables, we conclude 
also that the function F(£,y) is at the same time absolutely contin- 
uous with respect to $ on [a 1( for almost every value of y. 
The function F, which is by hypothesis of bounded variation on I 0 , is 
therefore absolutely continuous in the Tonelli sense on this interval. 

Conversely, if the function F is absolutely continuous on h, 
we have by Theorems 7.8 and 7.9, Chap. IV, for every subinterval 
I—\ay, ft; a 2 , ft] of I 0 , the relations: 

/ W i(-F; a 2 , ft) dx= f f\F'y(x, y)\dxdy, 

“> 

(I)= JW 2 (F-, y; a v Px)dy=f f\F' x (x, y)\dxdy, 

«* / 

so that the two functions of an interval H 1 and H 2J and therefor© 
also H y are absolutely continuous. 
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Finally, since the function H is absolutely continuous, the for* 
mula (6.6) is a direct consequence of the third of the relations (6.2), 
the latter being valid almost everywhere on account of Theorem 6.1. 

Up to the present we have regarded the expression H(F;7) 
as a function of an interval 7. If we treat this expression as a func- 
tional depending on the function F, we obtain the following theorem, 
whose geometrical interpretation will appear in § 8 when the theorem 
appears to be a generalization of Theorem 2.1. 

(6.8) Theorem. Given any sequence of continuous functions {F„} 
which converges to a continuous function F, we have for every interval I 

(6.9) liminf H(F„; 7) > H(F; I). 

n 

Proof. Denoting by 3p the subdivision of 7 into p 2 equal 
intervals, similar to 7, we have by Theorem 5.3 for any pair of in- 
tegers p and n, H(F„; I)^G(F„; 3 P ), and by Fatou’s Lemma (Chap. I, 
Theorem 12.10), for every integer p, lim inf G(F„; %)^G(F; %). 

n 

We therefore have lim inf H(F„; J)^G(F; 3p), and this leads to 

n 

(6.9) when p —> oo. 

$ 7. Radd’s Theorem. Before passing to the proof of the result 
of Rad6, according to which the area of any surface z=F(x,y) 
on an interval I is equal to H(i^;2), we shall prove the following 

(7.1) Theorem . If a continuous function F(x,y) has on an 
interval I 0 —[ a i, &i5 ^ 2 ] continuous partial derivatives , there exists 

a sequence of polyhedra {z~P n (x, y)) inscribed in the surface z=F(x,y ), 
such that the sequence converges uniformly to this surface and such that 

(7.2) lim S 0 (P„; I 0 )= f f{[F' x (x, y)?+\_F y (x, y)] 2 +l) Wy=H(F; /). 

n ' 7 

to 

Proof. Let 3n={In,i, In, 2 , ...» I n ,n‘) denote the subdivision of I 0 
into n 2 equal intervals similar to I 0 , and (#„,„ y n j), where *=1, 2, ... , ft 2 , 
the lower left-hand vertex of I n j. Let us divide any interval I„,i 
into two right-angled triangles T' nii and Tji,/ by a diagonal, in such 
a way that the vertex (#„,„ y n<l ) is that of the right angle of T' n ,,. 
Consider for any n the polyhedron z=P„(x, y) inscribed in the surface 
z—F(x,y) and corresponding to the net formed on J 0 by the 2ft 2 
triangles T' n i and where i=l, 2, ... , ft 2 . 
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For brevity let h n =(b 1 —a l )jn and k n =(b 2 — « 2 )/n; and let 
fi„ denote the upper bound of the differences | F' x (x", y") — F' x (x', y')\ 
and | F' y (x", y") — F' g (x y')\ for all points (x', y') and ( x ", y") of I 0 
such that \x " — and | y" — y'\^k n . 

Now if 8„j and s^ t denote respectively the elementary areas 
of the faces of the polyhedron z=P„(x, y) which correspond to the 
triangles T‘„,t and T’ n , i, we notice at once that the areas of the pro- 
jections of the former of these faces on the planes xz and yz are 
respectively equal to 

\hn-\F{x n j, y„,i+k n ) — F(x„ t t, y n ,i)\=^h n k n -\F' y (x nth y nJ )\ 

and 

5“ k n ‘ | F {X n ,i~ f“ yn,l ) F ( X n , /, 2/n, /)j = 1 k n kn' |-F iy yn,i)\ 

where x n j^x n ,t^.x n<i -[-h n and y’ n j^ y n .i+k„. 

We therefore have 8'„,i=i{[F' x (x' n , h y nJ )f + [F' y (x„j, y’ nJ )f + l} 1 2 • 
and so, by the inequality (5.1), p. 171, 

n a n* 

2*n, i- | ^{[F’ x {x„, „ y n , df+mxn , „ y, M )] 2 +l} 12 -!I n ,/|<{-^„-W. 

<=1 /=l 

Since the partial derivatives F' x and F’ g are by hypothesis continuous, 
it follows by making n-*oo that 

/!' 

lim 2 s ’-‘=lf fmx,y)f+[F u (x,y)f+l)' 2 dxdy, 
f =* /» 

and the same limit is clearly obtained for the sum of the f . 
By addition, together with an appeal to Theorem 6.5, we now 
derive the formula (7.2) and this completes the proof. 

In what follows we shall apply the method of mean value 
integrals. Given in the plane a summable function F(x,y), the 

lnln 

sequence of functions F n (x,y)—n 2 j jF(x + u, y +v)dudv where 

0 0 

n— 1 , 2 ,..., will be called sequence of mean value integrals of the 
function F(x, y). It is clear that if the function F is continuous , 

(i) the sequence of its mean value integrals {F n (x,y)} converges to F(x, y) 
at every point ( x , y) of the plane , and uniformly on any interval , and 

(ii) the partial derivatives 2F n /2x and 2F n j2y exist everywhere and are 
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continuous. In fact, at any point (x, y) a direct calculation gives 

Vn 

9F n (x,y)j9x=n 2 J [F(x+l/n, y+v)—F(x, y+v)]dv 
o 

and 

1 In 

9F„(x,y)l9y=ri i f[F(x+u, y+l/n)—F(x+u, y)]du. 

6 

It was T. Had 6 [2] who first applied in the theory of area of surfaces the 
method of mean value integrals. The role of these mean values is due to the fact 
that in the case in which the given function F is continuous, the sequence of 
areas of the surfaces z~F n (x,y) on any interval tends to the area of the surface 
z=F(x,y) (vide, below. Theorem 7.3). 

In the definition given above, the functions F n are defined at each point 
(x, y) as “mean values” of the function F over squares of which (x, y) is a vertex; 
it goes without saying that we could also make use of mean values taken over 
squares, or circles, having ( x , y) as their centres. These mean values over circles 
are used for instance in potential theory (cf. F. Riesz [4] and G. C. Evans [1]). 

(7.3) Radb 9 8 Theorem. If F(x, y) is a continuous function and 
{F n (x,y)} is the sequence of mean value integrals of F(x,y), then 

(7.4) H(.F; I 0 ) = S( J P; / 0 )=lim S(F„-, I 0 ) 

n 

for every interval I 0 . 

Proof. Let { z=P n (x , y)} be a sequence of polyhedra converging 
uniformly to the surface z=F(x, y), such that 

(7.5) lim S q(P 1q)—^(F) I 0 ). 

n 

Since the functions P n (%, y) are absolutely continuous, it follows 
from Theorem 6.5 (cf. also §2, p. 165) that S 0 (P n -, I 0 ) — H(Pn'j I 0 ) 
for every n. Consequently, since the sequences of functions {jP„} 
and {P n } converge uniformly to the function F , it follows by using 
successively Theorem 2.1, the formula (7.5) and Theorem 6.8, that 

(7.6) lim inf S(P„; / 0 ) > S (P; I 0 )=Um H(P„; I 0 ) ^ H(P; / 0 ). 

n n 

Now if the function P is not of bounded variation on 7 0 , it 
follows from Theorem 5.13 that H(P; J 0 )= + oo and consequently 
the formula (7.4) follows at once from (7.6). We may therefore as- 
sume that the function F is of bounded variation on J 0 , and further 
(cf. § 4, p. 170) that F is continuous and of bounded variation on 
each interval of the plane. 
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Let us agree to denote, for any set E in the plane, by E (u ' v) the 
parallel translation of E by the vector («, v) (cf. Chap. Ill, § 11); 
similarly, for a family of sets G in the plane, (S (u,v) will denote the 
family of all the sets obtained from sets (G) by subjecting them 
to this translation. For any subinterval I=[a lt b 1 ; o 2 , fc 2 ] we then 
obtain 

Gi(E„; I)= J | F n {x, b 2 )—F„(x, a 2 )|do;< 

1 n 1 jn 

j [ j f\F(x-\-u,b 2 -\-v) — F(x+u, a 2 -\-v)\dudv^dx= 

a . 0 0 

1 n 1 n 

= n 2 ffG 1 (F-,I iu,u) )dudv, 

6 6 

and a similar formula for G 2 . Hence by the inequality (5.2), p. 171, 
G(F„; /)={[G t (F n ; Z)f+[G 2 (F„; Z)]HI| 2 }' 2 ^ 

^ nf /{[G 1 (F; / (u>w) )]' 2 +[G 2 (F; I {u - v) )?+\/ u ’ v) ?)' 2 dudv = 
o 6 

1 n l/ii 

= nf j G(F; I (u ' v) )dudv. 

6 6 

Denoting by % the subdivision of I 0 into p 2 equal intervals 
similar to I 0 , we obtain therefore, for every p, 

1 n 1 n 

G(F„; 3p)<» 2 / /'G(F; $“' v) )dudv, 

6 6 

and since by Lemma 3.4, G(F; 3p U ' u> ) tends to H(F; Io u ' v) ) as p->oo, 
uniformly in u and v, we obtain in the limit 

1 n 1 n 

(7.7) H(F„; I 0 )^n*f /H(F; ? 0 u ' v) )dudv. 

6 6 

Finally the areas of the figures and 1o ~:Iq u ' v) tend 

to 0 with u and v and each of these figures is a sum of two intervals. 
Hence since the expression H (F; I) is by Theorem 6.1 a contin- 
uous function of the interval I, we have lim H (F; lb "' u> ) = H (F; I 0 ). 

0, f )->0 

On the other hand, since the functions F„(x, y) have continuous 
partial derivatives, we have, by Theorem 7.1, S(F„; IoXH(F n ; Io) 
for each n. Therefore making «->oo in (7.7), we find lim sup S(F„;ZoX 

n 

^H(F;/ 0 ), which in conjunction with (7.6) gives the required 
relation (7.4). 
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§ 8. Tonelli’s Theorem. The theorem of Radd just established, 
enables us to replace in all the theorems of this chapter the ex- 
pression H(F;Z) by the surface area S(F;Z). 

Thus for instance, Theorem 6.5 (formula (6.6)) expresses the 
fact that the elementary area of a polyhedron coincides with its area 
according to the general definition of area of a surface . 

Theorem 6.8 contains a generalization of Theorem 2.1; it enables 
us to replace in its statement uniform convergence by ordinary 
convergence: we thus obtain a theorem similar to Lemma of Fatou 
(Chap. I, Theorem 12.10). It follows that the uniform convergence 
of the inscribed polygons, required in the definition of area, may 
be replaced by the ordinary convergence, so that the area of a contin- 
uous surface z=F(x,y) is the lower limit of the areas of polyhedra 
tending to this surface. Further, by Theorem 7.1, if a function F (x, y) 
has continuous partial derivatives , there exists a sequence of polyhedra 
inscribed in the surface z—F(x , y), tending uniformly to the latter 
and having areas which converge to the area of this surface. (For further 
generalizations vide S. Kempisty [1]. Cf. also on this subject 
H. Rademacher [3], W. H. Young [5], M. Fr6chet [2] and 
T. Rad 6 [5].) Finally, we obtain the following theorem, which 
sums up the most essential considerations of this chapter: 


(8.1) Tonelli’s Theorem . a) In order that a continuous surface 
z—F(x,y) have a finite area on an interval Z 0 , it is necessary and 
sufficient that the function F(x, y) be of bounded variation on I 0 . 


b) When this is the case , we have 


S(F;Z 0 )^ 


//[< 


— )+ 
9x) + 


(dF 


+1 


t 2 

dxdy\ 


the expression $(Z) — S(F; I) is then an additive continuous function 
of the interval ZCZ 0 an d we have for almost every point (x, y)el 0 


S’(x, y) — 



c) In order that we should have 

<«> 

/» 

it is necessary and sufficient that the function F(x,y) be absolutely 
continuous on Z 0 ; and in order that this be the case it is necessary and 
sufficient that the area 8(F;Z) be an absolutely continuous function 
of the interval I C lo- 
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Proof. The assertion a) follows directly from Theorem 5.13; 
b) and c) follows from Theorems 6.1 and 6.5 on account of Theo- 
rem 7.4, Chap. IV. 

With regard to Theorem 8.1 vide L. Tonelli [5; 6; 7]. The necessity of con- 
dition a) was established a little earlier by G. Lampariello [1]. 

According to Tonelli’s theorem, the relation of equality (8.2) 
can hold for a continuous surface z=F(x 1 y) only in the case in 
which the function F is absolutely continuous. Nevertheless, as 
proved by L. C. Young, this relation will remain valid for arbitrary 
continuous surfaces, as soon as we replace on the right-hand side 
the partial derivatives by ratios of finite differences and transpose 
the passage to the limit outside the integral sign. In fact: 


(8.3) Theorem . For any continuous surface z=F{x,y) and any 
interval I Q we have 


(8.4) 


S(F;Z 0 )=lim 



F(x+a,y)—F(x,y) 

a 


2 

+ 


+ 


F(x, y+P)—F(x,y) 


P 


+1 


| 1/2 

)dxdy, 


and in order that the function F be of bounded variation on I 0J it is 
necessary and sufficient that 


(8.5) lim sup rTTjoT 
«,p-m> M+IpI 


J f\F(x+a,y+p)—F(x,y)\dxdy 


<1 + °°. 


Proof. Let {Fn} be the sequence of mean value integrals 
(cf. § 7, p. 178) of the function F. Denote, for brevity, by 72(a?,t/; a,/?) 
the expression under the integral sign on the right-hand side of (8.4), 
and by R n {x , y\ a, /?), for each positive integer n , the expression 
obtained from R(x, y\ a, p) by replacing F by F n . Finally let us 
write for w=l, 2 , ... 

R„(x, y)=lim R„(x,y; a, P)={[dF n (x, y)jdxf +[9F„(x, y)/dyf+ l} 1 2 . 

In order to establish the identity (8.4), it evidently suffices 
to show that 


( 8 . 6 ) 

and 

(8.7) 


S(F; I 0 )s^lim inf f f R(x,y, a, fi)dxdy 

a, 1 3 -X) J / 

*0 


S(.F; Z 0 )|>lim 8U P f f R(x,y; a, P)dxdy. 

a,ft -+ 0 J j 
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For this purpose, let I be any interval in the interior of I 0 . 
By means of the inequality (5.2), p. 171, we easily find that 

1 In 1/u 

(8.8) R n (x, y, a, f})^n 2 f f R(x+u,y+v-,a, P)dudv. 

6 o 

Now let n be a positive integer, sufficiently large in order that 
(x,y)el, \u\<l/n and |v|<l/n should imply (x+u, y+v) e I 0 . 

We then ha,vef J R(x-\-u,y+v] a, P)dxdy^.j f R(x,y; a, I3)dxdy and 
1 . 

consequently, by (8.8), j j R n (x,y; a, (l)dxdy ^ j' j R(x,y, a, fi)dxdy. 

/ h 

Making a-*0 and /?->0, we obtain in the limit S (F n ; Z) ^ 

lim inf f jR(x, y; a, fi)dxdy. This relation being thus established 
*0 

for each interval I C.Io, we may replace, on its left-hand side, I 
by I 0 , and making still »->oo we obtain the relation (8.6). 

In order to prove the relation (8.7), let us first observe 
that the latter is obvious in the case in which S(F; I 0 )= + °o. 
We may therefore assume that the function F(x,y) is of bounded 
variation on I 0 and moreover (cf. § 4, p. 170) of bounded variation 
on every interval in the plane and periodic with respect to each 
variable. We can therefore determine an interval <I 0 = [ a i) V> a 2 > &a] 
containing I 0 in its interior, such that its sides iq — % and b 2 — o 2 
are the periods of F(x,y) with respect to x and y respectively. 

This being so, we find easily, on account of the inequality (5.1), 
p. 171, that, for any pair of positive integers n and k, 

A-l 

Rn(x,y ; a,P)^^^R„(x+jalk, y+j^k] a/k, Pjk). 

/=« 

By integrating the two sides of this inequality over J 0 , and taking 
account of the periodicity of the function F, we obtain 

f fR n (x,y, a,P)dxdy*C. f jR„(x,y, a/k, P/k)dxdy, 

•A) 4> 

and hence, passing to the limit, making first k-> oo, and then »-*oo, 
we find by Radb’s Theorem 7.3, 
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f J R(x, y; a, P)dxdy^liml J R„(x, y)dxdy=]imS>(F„;J 0 )=8(F-,J 0 ), 


and so 
(8.9) 


limsup / I R(x,y; a,P)dxdy^S(F‘,J 0 ). 


Now, by the result already established in the inequality (8.6), 
we have lim inf / / R(x, y; a, ft) dxdy ^S(F; I) for every interval I. 

It follows at once that (8.9) remains valid when we replace the 
interval J 0 by any subinterval of J 0 , and in particular by the 
interval Z 0 . We thus obtain the relation (8.7). 

Finally let us remark that on account of the relation (8.4), 
in order that the area of the surface e=F(x, y) on I 0 be finite, 
it is necessary and sufficient that 

lim supi / / ;F(aH-a, y)—F(x, y)\dxdy <+oo 
«-m> |a| J J 

h 

and 

limjaup| J \F(x, y+P)—F(x, y)\dxdy <+oo. 


Now this pair of relations is easily seen to be equivalent to the 
relation (8.5) which therefore expresses a condition necessary and 
sufficient in order that the function F should be of bounded 
variation on J 0 . This completes the proof. 

A statement analogous to Theorem 8.3 can be made for curves (cf. Chap. IV, 
§ 8). If C is a continuous curve defined by the equations x—X(t) f y= ¥ ( t) t its length 
on any interval I 0 =[a,b] is given by the formula 

(8.10) S (C-, /„) = Um J- 1 + - F( - t+ ^~ r(t) j |<tt. 

a 

In particular therefore, in order that a continuous functions G(t) be of bounded 
variation on an interval [a, 6], it is necessary and sufficient that 

b 

lim sup * / h)— 0{t)\dt < -boo. 

/i-*> \h\J 

a 


( 8 . 11 ) 
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This assertion can be proved by the method of mean value integrals in a man- 
ner quite similar to that we made use of in the theory of areas of surfaces z—F(x , y) t 
but for curves this method can be very much simplified. Let us observe further 
that the relation (8.11) may be interpreted in a more general sense. In fact, given 
any summable function G(t ), the relation (8.11) is the necessary and sufficient 
condition in order that the function G be almost everywhere on [a, 6] equal to 
a function of bounded variation (vide G. H. Hardy and J. E. Littlewood [1]; 
cf. also A. Zygmund [I, p. 106]). 

Finally the relation (8.10) holds for any rectifiable curve given by the 
equations x=X(t), y~Y(t) t where the functions X(t) and Y(t) are not necess- 
arily continuous, provided however that for each t the point (X(t), Y (t)) lies 
on the segment joining the points (X(t — ), Y (t — )) and (X(l-f), Y (1 4-))- 
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Major and minor functions. 

§ 1. Introduction. Major and minor functions (defined in § 3 
of this chapter) were first introduced by Ch. J. de la Valine Poussin 
in his study of the properties of the Lebesgue integral and those 
of additive functions of a set. Entirely equivalent notions (of 
“Ober”- and “Unterf unktionen”) were introduced independently 
by O. Perron [1], who based on them a new definition of integral, 
which does not require the theory of measure. Although, in its original 
form, this definition concerned only integration of bounded functions, 
its extension to unbounded functions was easy and led, as shown 
by O. Bauer [1], to a process of integration more general than 
that of Lebesgue. Moreover, as we shall see in § 6, the Perron integral 
may be regarded as a synthesis of two fundamental conceptions of 
integration: one corresponding to the idea of definite integral as 
limit of certain approximating sums, and the other to that of 
indefinite integral understood as a primitive function. 

It is usual to associate these two conceptions of integration 
with the names of Leibniz and Newton. In accordance with this 
distinction (which is largely a matter of convention) we shall call 
a function of a real variable F indefinite integral, or primitive, of 
Newton for a function /, if F has everywhere its derivative finite and 
equal to /. The function / will then be termed integrable in the sente 
of Newton, and the increment of the function F on an interval I 0 , 
will be called definite integral of Newton of / on I 0 . As is seen im- 
mediately, this definition implies that any function which is integrable 
in the sense of Newton is everywhere finite. This restriction is 
essential (cf. the example of § 7, p. 206) for the unicity of integration 
in the sense of Newton, which then follows from classical theorems of 
Analysis, or, if we like, from Theorem 3.1, or from Theorem 7.1 
of this chapter. 

The theory of the integral was first developed on Newtonian lines. 
This is easily accounted for if we think how much simpler the inverse 
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of the operation of derivation must have seemed than the notion of de- 
finite integral as defined by Leibniz. It was A. Cauchy [I, t. 4, p. 122] 
who returned to the idea of Leibniz in order to apply it to integration 
of continuous functions, for which the methods of Cauchy and 
Newton are actually completely equivalent. This equivalence dis- 
appears, however, as soon as we pass on, with Riemann, to inte- 
gration of discontinuous functions. In fact, even in the domain 
of bounded functions to which the Riemann process applies, there 
exist on the one hand (as we see at once) functions which are inte- 
grate in the sense of Riemann but have no primitive, and on the 
other hand (as shown by V. Volterra [1]; cf. also H. Lebesgue 
[II, p. 100]) functions which have a primitive but are not integrable 
in the Riemann sense. Also the Lebesgue process of integration 
does not include the integral of Newton, not even when the func- 
tions to be integrated are everywhere finite. 

Thus, the function F(x) — x* sin (/r/x a ) for x + 0, completed by writing 
F (0)= 0, has in the whole interval [0, 1] a finite derivative which vanishes for a;=0 
and which is bounded on every interval [e, 1], where 0<e<l. On every interval 
0, 1] the function F(x) is therefore absolutely continuous. On the other hand, 
on the whole interval [0,1] the function is not even of bounded variation. Hence 
F'{x) is not summable on [0,1], since its indefinite Lebesgue integral could 
then differ only by an additive constant from F(x) on [0,1], and this is im- 
possible. 

We have thus been led to the problem of determining a process 
of integration which includes both that of Lebesgue and that of 
Newton. As an application of the method of major and minor func- 
tions, we shall consider in this chapter (§§ 6 and 7) the solution of this 
problem constituted by the Perron integral. Another solution, the 
Denjoy integrals, will be treated in Chapter VIII. 


The notions of major and minor functions, and their applications to Le- 
besgue integration, will be discussed here for arbitrary spaces R m . In defining 
the Perron integral, however, we shall limit ourselves to functions of one real 
variable. Although recently various authors have treated the extension of this 
integral to Euclidean spaces of any number of dimensions, the present state of 
the theory does not allow us to decide as to the importance of this genera- 
lization. On the contrary, in the domain of functions of a real variable, the 
method of major and minor functions as a means of generalizing the notion of 
integral has already repeatedly shown its fruitfulness. In the memoir of 
J. Marcinkiewicz and A. Zygmund [1], the reader will find new applications 
of this method in connection with certain fundamental problems of the theory 
of trigonometrical series (cf. also J. Bidder [ll]h 
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§ 2. Derivation with respect to normal sequences 
of nets. Given a regular sequence of nets of intervals 

(vide Chap. Ill, § 2) in a space JR m and a function of an interval F 
in we shall call upper derivate of F at a point x with respect to 
the sequence of nets 97 the upper limit of the ratio F(Q)/\Q\ as 
<*(<?) where Q denotes any interval containing x and belonging 
to one of the nets of the sequence 97. By symmetry we define 
similarly the lower derivate of F at x with respect to the sequence of 
nets 97. We shall denote these two derivates by (91) F(x) and (97) i^Gr). 
When they are equal at a point x, their common value will be 
denoted by (97) F {x) and called derivative of F at x with respect 
to the stquence of nets 91. 

These definitions are similar to those given in § 15, Chap. IV, in connection 
with derivation of additive functions of a set ( 33 ) in a metrical space. It should 
be observed, however, that additive functions of a set ( 33 ) correspond to ad- 
ditive functions of an interval of bounded variation, whereas in the present § we 
treat derivation of additive functions of an interval without supposing them 
a priori of bounded variation. For this reason it will be necessary to impose 
certain restrictions on the nets considered in this §, and to distinguish a class 
of nets which we shall call, for brevity, normal nets. The latter are, in point of 
fact, the nets occurring most frequently in applications (cf., for instance. Chap. Ill, 
p. 58). 

A system of intervals will be called a normal net in the space U m , 
when it consists of the closed intervals [(4°, < 4 + 1 ; af\ < 4 + 1 ; ...; a*+\] 

for k — 0, ± 1) which are determined by systems of num- 
bers af subject to the conditions a/p ^ or i = 2, ..., w and 

k — ..., — 1, 0, +1, ..., and lim aW = ±o°. A regular sequence of 

*+ • OO 

normal nets will be termed normal sequence . 

(2.1) Theorem . Let 97 = {974 be a normal sequence of nets , g(x) 
a f unction which is summable in the space M m and F a continuous 
additive function of an interval such that (i) (3l)F(x)> — oo at every 
point x , except at most those of an enumerable set, and (ii) F'(x)^g(x) 
at almost all the points x at which the function F is derivable in the 
ordinary sense. 

Then for every interval l, we have 

(2.2) F(l)^fg(x)dx ; 

/ 

i . e. F is a function of bounded variation , whose function of singul- 
arities is monotone non-negative. 
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Proof. Consider the points in every neighbourhood of which 
there exist intervals 1 for which the inequality (2.2) is false, and let 
P denote the set of these points. The set P is evidently closed, 
and we see easily that the relation (2.2) must hold for every interval 
I such that Z° C CP. For if this were not the case, we could deter- 
mine first an interval IC.GP such that F(I)<J g(x)dx, and then, 

/ 

by the method of successive subdivisions, a descending sequence {I„\ 
of subintervals of I such that d(I n )->0 as n-*oo and that 

F(I„X I g(x)dx for n— 1, 2, .... Therefore, denoting by a the com- 

mon point of the intervals Z„, we should have a e P, which is clearly 
impossible. 

It follows that in order to establish the validity of the in- 
equality (2.2) for all intervals Z, we need only prove that P=0. 
Suppose therefore, if possible, that P ^ 0. Let us denote, for any 
pair of positive integers k and h, by N k , * the sum of all the intervals Z 

OO 

of the net 91* for which F(I)>-h-\I\. Therefore by writing A t *=//.2V*,a, 

k^h 

we obtain a sequence {N/,} of closed sets whose sum, according to 
condition (i), covers the whole space except for an at most enumerable 
set. Consequently, on account of Baire’s Theorem (Chap. II, Theo- 
rem 9.2), the set P contains a portion which either consists of 
a single point, or else is contained in a set K h . The former case is 
excluded since it is evident from the continuity and additivity of 
the function F that the set P contains no isolated points. Therefore 
there exists a positive integer h 0 and an open sphere 8 such that 

Q^=P-8(ZNh„- Let us write H(I)=F{I) + h 0 -\l\ + f\g(x)\dx where I 

i 

is any interval. We shall have H (/) ^ 0 for any interval 1 such 
that Z°CCP, as well as for any interval Z belonging to a net 91* 
of index k^h 0 and having points of the set N ha in its interior. 
Therefore ZZ(Z)^0 for any interval I(ZS belonging to a net 91* 
of index k^h 0 , and consequently, by additivity and continuity of U, 

we have ZZ(Z)^0, i.e. F(I)^-h 0 -\l\— I \g(x)\dx, for any inter- 

i 

val I(Z8 whatsoever. It follows at once that the function F 
is of bounded variation in 8 and that the function of singularities 
of F (cf. Chap. IV, p. 120) is monotone non-negative in S. 
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Hence, by condition (ii), F(I)^ f F'(x)dx^> f g(x)dx for every inter- 

/ / 

val IQS. But since P /S=(=0 we thus arrive at a contradiction and 
this completes the proof. 

As an immediate corollary of Theorem 2.1, we have 

•(2.3) Theorem . If 91 is a normal sequence of nets in the space ll m 
and if F is a continuous additive f miction of an interval such that: 
(i) — oo < (9t)P(a?) ^ (91)F(a?) < + oo for each point x except at most 
the points of an enumerable set , (ii) the ( ordinary ) derivative F\x) is 
summable on each portion of the set of the points at which this deriva- 
tive exists ; then the function F is almost everywhere derivable and is 
the indefinite integral of its derivative. 

For Theorems 2.1 and 2.3 cf. J. Ridder [2], Let us remark that in the 
case where the function F is of bounded variation, these theorems are included 
in Theorem 15.7, Chap. IV, which concerns derivation of additive functions of 
a set in an abstract metrical space. 

It follows easily from Theorem 15.12, Chap. IV, that — 

almost everywhere for any regular sequence of nets of intervals 91 and for any 
additive function of an interval F which is continuous and of bounded variation. 
This remark enables us to replace condition (ii) of Theorem 2.1 by the following: 
(ii -bis) F'(x)~( ( 3l)F'(x)'^g(x) at almost all the points x at which the two derivatives 
F'(x) and (yi)F'(x) exist , are finite and equal. Similarly we may modify condi- 
tion (ii) of Theorem 2.3. 

As it follows from an example due to A. J. Ward [7], the jnequality 
(9 l)F(x) — oo in condition (i) of Theorem 2.1 cannot bereplacedby (9 l)F(x)> — oo. 

§ 8. Major and minor (unctions. Before introducing the 
fundamental definitions of the theory of the Perron integral, we 
shall prove 

(3.1) Theorem. If an additive function of an interval F (not neces- 
sarily continuous ) has a non-negative lower derivate at each 'point x 
of an interval I 0 , then F ( I n ) J? 0. 

Proof. Lett he any positive number and write G(I)—F(I)+e-\I\ 
for every interval I. Then G(. r)^f>0 at each point xel 0 . Suppose 
that <?(/ 0 )^0. By the method of successive subdivisions, we could 
then determine a descending sequence {I,,} of intervals similar to I 0 , 
such that <?(/„)< 0 for n=0, 1,2, ... and that <S(J„)->0 as »-► oo. 
Therefore, denoting by x 0 the common point of the intervals 
we shoulcjL have O(x o )^0 which is impossible. Hence G(I 0 )>0, and 
this gives F(I 0 ) > — c-|/ 0 ! f° r each £ >°> and finally P(I 0 )> 0. 
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An additive function of an interval F is termed major [minor] 
function of a function of a point / on a figure R 0 if, at every point x 
of this figure, — oo^^)^/^) [+oo^=jP s (®)^/(®)]. It follows at 
once from Theorem 3.1 that if the functions of an interval U and V 
are respectively a major and a minor function of a function / on 
a figure R p , their difference V —V is monotone non-negative on R 0 . 

(3.2) Theorem . If f is a summable function , then, for each * > 0, Me 
function f has an absolutely continuous major function f, J7 and an 
absolutely continuous minor function V such that , for each interval I, 

(3.3) 0^17 (I) — lf(x)dx^e and 0^ff(x)dx — F(!)^e. 

/ / 

Proof. On account of the theorem of Vitali-Carath^odory 
(Chap. Ill, Theorem 7.6) we c&n associate with the function / two 
summable functions, one a lower semi - continuous function g 
and the other an upper semi-continuous function h , such that 

(i) — g(x)^f(x)^h(x) 4= + °° at every point x and that 

(ii) f[g(x) — f(x)]dx<e and f[f(x) — h(x)]dx<.e for every interval!. 
/ / 

Therefore, if we denote by U and V the indefinite integrals of the 
functions g and h respectively, we find by Theorem 2.2, Chap. IV, 
that U s (x)^ g(x)^h(x)^V*(x), and so, on account of (i), that 
— 004 z lh(x)^f(x) and +oo=^ F s (a?X/(a?) at each point x. Finally, 
(ii) then implies the relations (3.3) and this completes the proof. 

Theorem 3.2 can easily be inverted. Thus: in order that a function of a point f 
be summable , it is necessary and sufficient that for each e>() there exist two absolutely 
continuous functions of an interval XI and F, the one a major and the other a minor 
function of /, which fulfil the condition XI (I) — V (I) < e for every interval I. (These 
absolutely continuous functions may clearly be replaced by functions of bounded 
variation, and if the function / is supposed measurable, then, of course, for its 
summability there suffices the existence of two functions of bounded variation, 
one of which is a major and the other a minor function of /.) 

* § 4. Derivation with respect to binary sequences of 
nets. The theorems of § 2 concerned derivation of additive functions 
with respect to any normal sequence of nets of intervals. For certain 
purposes however, more special sequences of nets are required. 
We shall say that a normal sequence of nets in the space 

i? m is binary, if the net 9t* +) (where k— 1, 2, ...) is obtained by 
subdividing each interval N of the net 91* into 2"' equal intervals 
similar to N. 
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An application of this notion may be found in the following theorem which 
is proved similarly to Lemma 11.8 of Chap. IV: If 91 is a binary sequence of nets , 
any additive function of an interval F is derivable with respect to 91 at almost all 
the points at which either (9 l)F(x )> — oo or (yi)F(x)<.+oo. 

Another application, of particular interest, is due to A. S. Besicovitch [3] 
who, by using derivation with respect to a binary sequence of nets, established 
a theorem on complex functions ( vide below § 5). The substance of Besicovitch’s 
result is contained in Theorem 4.4 below. We must first, however, give some 
subsidiary definitions. 

For definiteness, just as in § 11, Chap. IV, we shall fix in the 
space R m a binary sequence of nets Q={Q*}, where Q* denotes, for 
4—1,2,..., the net formed by the cubes 

[ Pl /2*, (p { + l)/2 h ; pj2 h , (p 2 + l)/2*; ...; pJ2 k , (p m + l)/2 k ] 

where p v p v ..., p m are arbitrary integers; it goes without saying 
that in Theorem 4.4 this sequence may be replaced by any binary 
sequence whatsoever. 

Given a non-negative number a, we shall say that a function 
of an interval F fulfils the condition (1« ) [condition (1« )] at a point#, 
if lim inf F(I)j[d(I)] a ^ 0 [lim sup F(I)/[d(I)]“^ 0], where 7 is any 

6(1 0 <5(/)-M) 

interval containing x. If a function / fulfils the condition (1« ) [(17)] 
at every point of a figure 12, we shall say simply that / fulfils this 
condition on R . Finally, we shall say that a function fulfils the 
condition (1«) at a point, or on a figure, if it fulfils simultaneously 
the conditions (1« ) and (17). 

We recall further the notation A a (E) for the a-dimensional 
measure of a set E (cf. Chap. II, p. 53). 

(4.1) Lemma . Given a set E in the space R mj together with a positive 
integer 4 0 and a non-negative number a< m, there exists for each £>0 
a sequence of intervals belonging to the nets Q* for k^k 0 , which 
fulfils the folloiving conditions : 

(i) £ [<5«wr < (4 rh) m -[Aa(E) +£]; 

n 

(n) to each point x of E there corresponds a positive integer k^k 0 
such that all the intervals of the net Q* which contain the point x belong 
to the sequence 

Proof. Let us cover E by a sequence {25/}/=, 1,2.... of sets such 
that 0<d(Ei) < for i= 1,2,... and such that 

(4.2) Z[6(Ei)] a ^A u (E) + s. 
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Let us denote by k i} for each t=l,2, a positive integer 
such that 

(4.3) 1/2*' > <?(£,) ^?l/2*' +1 . 

We easily see that /£/>&o for every i, and that each net CL,, 
for i=l,2, ..., can contain at most 2 m intervals having points in 
common with E,. Let 2 be the sequence of all the intervals 

belonging to the nets CL,, Q* 2 , ..., CL,, ... and having points in com- 
mon with the sets respectively. The sequence {Q n } 

clearly fulfils the condition (ii). Moreover, we find on account of (4.3), 

Z[d(Q n )]“ <2 m -Zm a -2- ak i^2 m 7n“2 a .£2- a(k ^ l) ^(imr 

n i / i 

and this by (4.2) gives at once the condition (i). 

(4.4) Theorem . Suppose that F is a continuous additive junction of 
an interval in the space R m and fulfils the condition (1«) where O^a < m, 
and that g is a summable function. Suppose further that (i) (Q )F(x) > — oo 
at every point x except at most those of a set E expressible as the sum 
of an enumerable infinity of sets of finite measure (d«), and that 
(ii) (Q )F(x)^ g(x) at almost all points x\ then 

(4.5) F{I 0 )^ I g(x)dx 

4 

for every interval 7 0 . 

Proof. Since the function F is continuous, it will suffice to 
prove (4.5) in the case in which the interval I 0 belongs to one of 
the nets CL, to the net CL„, say. Further by changing, if necessary, 
the values of g on a set of measure zero, we can assume that 
the inequality (Q)F(x) ^g(x) holds at every point x. 

Let e be a positive number and let V be a minor function of g 
(cf. § 3, particularly Theorem 3.2) such that 

(4.6) V(I 0 )> I g(x)dx—t. 

4 

Let us write G(I)=F(I)-V(I)+e-\I\, where / denotes any interval. 
We shall have (Q)G(x)^(Q)F(x) — V {x)-\-e~^-t >0 at every point 
x except at most at the points of E. Finally, since V s (x) <-f oo at 
every point x, the function V fulfils the condition (IT) and the func- 
tion G therefore fulfils the condition (1/ ). 
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Let us now represent the set E as the sum of a sequence 
{Et}i= 1,2,... of sets of finite measure {A a ), and denote, for each pair of 
positive integers i and n, by R, <n the set of the points x such that the 
inequality (7(1) > — f-[<5(/)]“/2 / [A„{Et)-\- 1] holds whenever 1 is an 
interval containing x and belonging to one of the nets Q* for k'^?n. 
The sets R lt „ are evidently measurable (D) (they are actually sets (©<>)). 
Moreover, since the function 0 fulfils the condition (1^), - the sum 
2JRi,„ must, for each integer i, cover the whole space R m . Hence, 

n 

writing Ei tn =E r (Ri, n — Ri,n- 1 ) for «>1, and E iA = E r R ltU we 
find that 

(4.7) A«(E,) = £A a (E l<n ' for i= 1 , 2 ,.... 

n 

This being so, it follows from Lemma 4.1 that for each pair 
of positive integers i and n, we can determine a sequence {Q ( /^! y=1 2 
of cubes which belong to the nets Q* for k^n, and fulfil the fol- 
lowing conditions: 

(4.8) Zmyi)T<(*™) m -[A a (E ln ) + ll2 n ]-, 

(4.9) to each point xeE itn there corresponds an integer k^k^ such 
that each cube of the net Q*, containing x , belongs to the se - 
quence {Q ( fy J=lx 

(4.10) each cube Q\ j) n has points in common with the set E j n and therefore 
fulfils the inequality G(Q { { J) n ) > — e’[6(Q) J) n )f l2 l -[A a (E ( ) + l]. 

For brevity, let us agree to say that an interval has the prop- 
erty {A), when it is representable as the sum of a finite number 
of non-overlapping intervals I each of which either fulfils the in- 
equality (?(/)> 0, or else coincides with one of the cubes Q\ j) n . We 
remark that on account of (4.10), (4.8) and (4.7), the inequality 

0(R)>- e-Z WQ^TVUjE') +1]> 

t,n,J 

^-(4m) m e-Z[Aa(E,, n ) + l/2"]/2'[/l a (E t ) + 1] = - (4 mf • e 

/, n 

is valid whenever R is a figure consisting of any finite number 
of non-overlapping cubes Q\ J \ and therefore that the inequality 
0(1)^— (4w) m -f must hold for every interval I having the prop- 
erty (.4), 
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We shall now show that the interval I 0 itself has the prop* 
erty (A), so that G(I 0 — (4m) m e. Let us suppose the contrary. 
We could then, starting with I 0 , construct a decreasing sequence 
{I p } of cubes belonging to the nets Q * and none of which has the 
property (A). Let x 0 be the common point of these cubes. Then 
either x 0 eE, and consequently, by (4.9), the sequence contains 
cubes or x 0 e GE, so that (Q)G(x 0 ) >0, and therefore 0(1,,) > 0 
for each sufficiently large p. Thus in both cases, the sequence { I p } 
would contain intervals with the property (A) and we arrive at 
a contradiction. It follows that G(I 0 ) ^ — (4m) m f, and therefore, 
by (4.6), that 

F(I 0 )=G(I 0 )+V(I 0 )-e-\I 0 \> fg(x)dx-[l+(4m) m +\I 0 \]t-, 

h 

since e is an arbitrary positive number, this gives the relation (4.5). 

#§5. Applications to functions of a complex variable. 

We now interpret the points of the plane i? 2 as complex numbers 
and, as usual, we call complex function of a complex variable every 
function of the form u + iv where u and v are real functions in 
the whole plane, or in an open set. The functions u and v are 
termed real part and imaginary part of the function /. A complex 
function is said to be continuous (at a point, or in an open set), if 
its real and imaginary parts are both continuous. 

Given a complex function /, continuous m an open set 6r, and 
having the real and imaginary parts u and v respectively, we shall 
write for every interval / = [«i, b x \ a 2 , b 2 ] contained in G: 

b x b 

J i(/; I) = — I [u(x, b 2 ) — u(x, « 2 )] dx— I [«(&!, y) — v(a v y)] dy, 

a, a, 

< 5 - 1 ' », 

J 2 (/; /)= / [m(6|, y)~ M(« lt y)]dy — f\ [v lx,b 3 ) — v{x, a,)] dx, 

a, a x 

and 

The expression J (/;/), which will also be denoted by ffdz, will 

(/) 

be called curvilinear integral of the function / along the boundary 
of the interval I. The function / will be termed holomorphia in an 
open set G, if J(/;/) = 0 for every interval IQG. (The equivalence 
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of this definition of the term „holomorphic“ — used here in place 
of terms such as “regular”, “analytic”, etc. — with the more familiar 
definitions of the theory of complex functions, follows from the 
well-known theorem of Morera [1].) We verify at once that this 
relation holds when f(z)=az+b where a and b are any complex 
constants. 

If / is a complex function, continuous in an open set G, the 
expressions Jj(/; I) and J 2 (f; I) are continuous additive functions 
of the interval 1 in 0. Moreover 

|J,(/;/)|<|J(/;/)| and | J 2 (/; /)|< | J(/; 7)| 


for each interval I in G. On account of Theorem 2.3, we there- 
fore obtain at once the following theorem due to J. Wolff [1] 
(cf. also H. Looman [2] and J. Ridder [1; 2]): 

(5.2) Theorem. A complex function /, continuous in an open set G, 
is holomorphic in 0 if at almost all points z of G, 

liminf-ri-l f f(z) dz 1 = 0, 

<5«J)-*> V \J I 

(Q) 


and if at all points z of G, except at most those of an enumerable set, 


lim 8up -rprr f f(z)dz 
«?> 


< + °o, 


where Q denotes any square containing z. 


A complex function is called derivable at a point z 0 , if the ratio 
[f(z) — f(zo)]l( z — z o) tends to a finite limit when z tends to z 0 in any 
manner. This limit is called derivative of / at z 0 and is denoted by f (z 0 ). 

Let / be any complex function, defined in the neighbourhood 
of a point z 0 . If we have lim sup \[f{z 0 + h)— f(z 0 )]/h\ < + oo, we 

h -*0 

can write f(z)—f(z 0 )-\-M(z)-(z—z 0 ), where M(z) is a function of z 
which is bounded in the neighbourhood of z 0 ; and we then easily 
find that the ratio \J(f]Q)\l\Q\, and a fortiori the ratios | J 1 (/;<2)|/|Q| 
and | J 2 (/; Q)\l\Q\i must remain bounded when Q denotes any suf- 
ficiently small square containing z 0 . If, further, the function / is 
derivable at z 0 , we have /(z)=/(z 0 )+ /'(z 0 Mz— ^ «<>) + * ( z )-( z — z o)> 
where |e(z)|->0 as z—rz 0 , and the ratios in question tend to zero as 
S(Q)-rO. Finally, let us observe that if the function / is continuous, 
the expressions J t (f; I) and J a (/; I), considered as functions of the 
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interval I, both fulfil the condition (1J of § 4. Therefore, if we apply 
Theorem 4.4, we obtain the following theorem due to A. 8. Besi- 
covitch [3] (cf. also 8. 8aks and A. Zygmund [2]): 

(5.3) Theorem . A complex junction /, continuous in an open set Gy 
is holomorphic in G if it is derivable at almost all the points of G and if 
further lim sup \[f(z-\-h) — f(z)]/h\ < + oo at each point z of G except at 

most those of a set which is the sum of a sequence of sets of finite length . 

The theorem of Besicoviteh may be regarded as a generalization of the 
classical theorem of E. Goursat ( 1 ] : A complex junction /, continuous in an open 
set ( r , is holomorphic in G if it is everywhere derivable in G. T. Pompeii! [1] showed 
that it is enough to suppose / derivable almost everywhere, provided that 

we restrict the expression lim sup \[f{z-\-h) — f(z)]/h to be bounded in G. Finally, 

/!—►() 

H. Loomari [3] (cf. also J. Ridder [2]) replaced the condition that the expres- 
sion lim sup f f(z+h) — f(z)]/h\ is bounded by the condition that this expres- 
/?-M) 

sion is finite at each point of G. Theorem 5.3 evidently includes all these 
generalizations. 

The theorems of Morera and of Goursat, and their generali- 
zations furnished by Theorems 5.2 and 5.3, contain criteria for 
holomorphism which are based on the notion of curvilinear integral 
and of derivation in the complex domain. The classical theorem of 
Cauchy is an instance of a criterion of a different kind, expressible 
in terms of real variable conditions on the real and imaginary parts 
of a complex function; we have in fact, according to this theorem: 
in order that a continuous function of a complex vari- 
able f(z)=u(Xjy) + iv(x,y) be holomorphie 'in an open 
set Gy it is necessary and sufficient that the partial 
derivatives u' x , u y1 v x , v y should all exist in G and be 
continuous, and that they everywhere fulfil the Cauchy- 
Biemann equations u x =v, and u y =—v x . 

A series of researches begun by P. Montel [1] has been de- 
voted to the reduction of these conditions, particularly that of the 
continuity of the partial derivatives. The problem was finally solved 
by H. Looman [2] and D. Menchoff (vide the first ed. of this book, 
p. 243, and D. Menchoff [I]) who succeeded in removing completely 
the condition in question without replacing it by any other. It is 
remarkable that a classical problem of such an elementary aspect 
should only have been solved by a quite essential use of methods 
of the theory of real functions. 
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(5.4) Lemma . Let w be a real function of one variable , derivable 
almost everywhere in an interval [a, 6]; let F be a closed non-empty 
subset of this interval , and let N be a finite constant such that 

| w (x 2 ) — w (ajJ 1* 2 — I 

whenever x t eF and x 2 e [a, b ]. Then 

(5.5) | w(b) — w(a ) — f «/(a?)<fo|<i •$"•(& — a — |.F|). 

Proof. Let us denote by F t the set obtained by adding the 
points a and b to the set F. The function w, equal to w on F x and 
linear on the intervals contiguous to F v is evidently absolutely 
continuous on [a, b] (and even fulfils the Lipschitz condition). Hence 

b 

(5.6) w(b)—w(a)= w(b) — w(a) = f w(x)dx. 

a 

Now w'(x) = w'(x) at almost all the points x of F and | £?'(#) | <1 N 
at each point x outside F. The relation (5.5) therefore follows at 
once from (5.6). 

(5.7) Lemma . Let w(x,y) be a real function whose partial derivatives 
with respect to the two variables x and y exist at every point of 
a square except at most at the points of an enumerable set ; and 
let F be a closed non-empty subset of Q, and N a finite constant 
such that 

k (® 2 > Vi)— w> (•»« 3/i)l < ®il and | w ( x v y 2 ) —w (x u y 2 )| < N- |y 2 — y x \ 

whenever (x^yJeF, {x 2 , y^eQ, and (x v y 2 )eQ. 

Then if [c^, b x ] a 2 ,b 2 ] denotes the smallest interval (which may 
be degenerate) containing F, we have 
t>* 

| f [w(x, b 2 ) — w(x, a 2 )]dx — J f w y (x, y)d#dy|<5iV-|Q — F\ 

(5.8) \ * 

|/ [»(*!* V)— «>(%, y)]dy—f fw' x (x, y)dxdy^5N-\Q — F|. 

a t F 

Proof. Let us choose arbitrarily two points (x', a 2 ) and (x", b 2 ) 
belonging to the set F and situated on the two sides of the interval 
[<*i, brf a 2 , b 2 ] parallel to the rc-axis. For any point £ of fo, ftj] we 
have \w(£,b 2 )—w(£, <* 2 )Kk(£, b 2 )—w(x", 6 2 )| + |w(®", b 2 )—w(x', 6 2 )| + 
+ \w (x\ b 2 ) —w(x\ a 2 )\ + \w (*', a 2 ) -w(£, a 2 )| ; and hence, denoting by l 
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the length of the side of the square Q, we obtain 

Ms,**)— 

{ ' ^N-[\x"-£\ + \x'~x"\ + \a t -b,\ + \g-x'\]^4m. 

We now denote for any point £ of [a ly by F$ the set of 
all the points y of [a 2 , b 2 ] such that (^ y y)eF. Let A be the set of 
the points £ of the interval [a ly b x ] for each of which F^O, and 
let B denote the set of the remaining points of [a xi b x ]. On account 
of Lemma 5.4 we have 

w(£, bi) — «?(£, ai) — f w y (i, y)dy\*CN-(b 2 —a‘i — \F$\) 

whenever ?ei, and if we integrate the two sides of this inequality 
with respect to £ on the set A , we find 

I f [w(£, b 2 )—w(£, a 2 )] d£ — f [*>'„(& y)dyd£ 

(5.10) 'V 1 


On the other hand if we integrate (5.9) with respect to £ on the 
set B , we obtain ||[w(£,6 2 ) — w(£, a 2 )]d£|^4^i-|^|^4JV r - \Q — F\ } 

B 

and by adding this to (5.10) we obtain the first of the inequalities (5.8). 
The second inequality follows by symmetry. 

(5.11) Theorem of Looman-Menchoff. If the functions u(x,y) 
and v(x , y), continuous in an open set 0 , are derivable with respect 
to x and with respect to y at each point of G except at most at the points 
of an enumerable sret , and if u x (x,y)=Vy(x,y) and u y (x, y)= — v x (x 7 y) 
at almost all the points ( x , y) of G } then the complex function f=u + iv 
is holomorphic in G . 

Proof. Let us denote by F the set of the points (x, y) of G 
such that the function / is not holomorphic in any neighbourhood 
of ( x , y). The set F is evidently closed in G and the function / is 
holomorphic in G—F. It thus has to be proved that F is empty. 

Suppose therefore, if possible, that F^O and let F n denote, for 
each positive integer n, the set of the points (#, y) of G such that, 
whenever |ft|^l/n, none of the four differences u(x + h, y)~ u(x, y) 
u(x,y+h)— u(x,y), v(x-{'h J y)—v(x J y), v(x,y + h)—v(x,y) exceeds \nh\ 
in absolute value. By continuity of the functions u and v, each 
of the sets F„ is closed in G . On the other hand, the sets F n cover 
the whole set G , except at most an enumerable set consisting of 
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the points at which the functions u and v are not both derivable 
with respect to x and with respect to y simultaneously. Therefore, 
on account of Baire’s Theorem (Chap. II, Theorem 9.1), the set F(Z O 
contains a portion which either reduces to a single point, or else is 
contained entirely in one of the F n . The former possibility is ruled out, 
since, as we easily see on account of the continuity of /, the set F 
cannot contain any isolated points. There must therefore exist a 
positive integer N and an open sphere 8 such that Q^=F -S(ZF N . 

Let Q be any square contained in S. such that d (Q)^l/N 
and We denote by I = [a v b L ; a 2J b 2 ] the smallest interval con- 

taining Q-F. By applying the evaluations of Lemma 5.7 to the 
integrals on the left-hand sides of the formulae (5.1) and by taking 
into account the relations u x (x 1 y)—Vy(x J y) and ^y(x J y)~—v , x {x,y) 
which are, by hypothesis, fulfilled almost everywhere, we find 
|J 1 (/;I)j^lOA/ r -|^— F| and | J 2 (/; 7)| < 10iV-|$ — F\, and therefore 
| J (/; 7)|^20A7 *|^— jP|. This last inequality may also be written 
j J (/; F|, since the figure QQI contains no points 

of F in its interior, and since therefore J(/; J£)~0 for each interval 
R contained in QQI. 

Now let z 0 —(x 0 ,y 0 ) be any point of 8 , and let Q be any square 
containing z 0 . By what has just been shown, if z 0 eF we have 
|J(/; Q)\I\Q\K20N-\Q — F\I\Q\ as soon as <HQ)^ 1/N; the ratio 
J(/; Q)/\Q\ therefore remains bounded as <S($)->0 and tends to zero 
whenever z 0 is a point of density of F. Further | J(/; Q)\l\Q\~>0 as 
(J(#)->0, whenever z 0 eS—F , since J(f- } Q)=Q for every square Q 
which does not contain points of F. Therefore by Theorem 2.3, 
the function / must be holomorphic in 8. This is, however, excluded 
since S-F 4=0. We thus arrive at a contradiction and this completes 
the proof. 

Theorem 5.11 was stated (even in a more general form) by P. Mon tel [2] 
as early as 1913, but without proof. The proof supplied by H. Looman [2] in 1923 
was found to contain a serious gap which was only finally filled in by D. Men- 
choff (cf. D. Menchoff [I] and the first edition of this book, p. 243). 

By making use of general theorems on denvates (vide, below, Chap. IX) 
it is possible to weaken slightly the hypotheses of the theorem. Thus instead 
of assuming partial derivabiiity of the function u and v y it is sufficient to sup- 
pose that at each point of 0 (except at most those of an enumerable 
set) these functions have with respect to each variable, x and y, 
their partial Dini derivates finite. This condition implies (cf. Chap. VII, 

§ 10. p. 236, or Chap. IX, §4) partial derivabiiity of the functions u and v with 
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respect to each variable at almost all points of G (this generalization of the 
theorem of Looman-Menchoff does not require any alteration of the proof; 
for other and much deeper generalizations, vide the memoirs of D. Menehoff 
[1: 2]). 

The extension of Theorem 5.11 which we have indicated, includes in par- 
ticular the theorem of Looiuan mentioned above, p. 197, but not however the 
theorem of Besicovitch (5.3). It would be interesting to establish a theorem which 
would include both the theorem of Besicovitch and that of Looman-Menchoff. 


§ 6. The Perron integral. For functions of one real vari- 
able, as announced in § 1, the method of major and minor func- 
tions lends to an important generalization of the Lebesgue integral. 

A function of a real variable, /, is termed integrable in the sense 
of Perron , or integrable, on a figure R 0 in It v if 1° / has both major 
and minor functions on /? 0 , and if 2° the lower bound of the numbers 
U(K 0 ), where V is any major function of / on R 0 , and the upper 
bound of the numbers V(R 0 ), where V is any minor function of /, 
are equal. The common value of the. two bounds is then called 
definite Perron integral , or definite integral , of / on /f 0 , and de- 
noted by (c?) j f{x)dx. It is evident that for cf'-integrability of afunc- 

R, t 

lion f on a figure R 0 it is necessary and sufficient that for each t >0 
there should exist a major function U and a minor function V of f on R 0 
such that U(R 0 ) — V(R 0 )<f. 

Since (of. § 3, p. 190) the function U — V is monotone non- 
decreasing for every major function U and every minor function V 
of /, it follows that every function which is <P - integrable on a figure R 0 , 
is so also on every figure R(2 R (y The function of an interval 

P(I) = (&') j f(x)dx, thus defined for every interval 1(2 R Q , ™ called 
/ 

indefinite Perron integral , or indefinite cT- integral , of / on /? 0 . As we 
see at once, P(I) is an additive function of the interval I. Moreover, 
given any positive number f , there exist always a major function V 
and a minor function V of /, such that 0^ U(I) — P(I)^£ and 
0^P(/)— V(I)^e for every interval I C RqI and since U(x)>— oo 
and + at each point x of R 0 , it follows at once that the 

function P is continuous. Just as in the case of the Lebesgue integral, 
a function of a real variable is termed iitde finite integral [major 
function , minor function ] of a function /, if this is the case for the 
function of an interval determined by it (cf. Chap. Ill, § 13). 
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As we see at once from Theorem 3.2, every function which 
is integrable in the sense of Lebesgue on a figure B 0 , is so in the sense 
of Perron, and its definite Lebesgue and Perron integrals over R 0 are 
equal. On the other hand, if F is the primitive of Newton (cf. § 1) 
of a function /, the function F is at the same time a major and 
a minor function of /, and therefore is the indefinite ^-integral of /. 
It follows that Perron's process of integration includes both that of 
Lebesgue and that of Newton. 

We shall establish some fundamental properties of the Perron 
integral. 

(6.1) Theorem. Every F-integrable function is measurable, and is almost 
everywhere finite and equal to the derivative of its indefinite integral. 

Proof. Let / be a function of a real variable, <F - integrable 
on an interval I 0 , and let P be its indefinite ^-integral on I 0 . It has 
to be proved that the function P has at almost all points x, a finite 
derivative equal to f(x). 

For this purpose, let e be any positive number and U a major 
function of / such that 

(6.2) U (/„) — P(l 0 ) < « 2 . 

Let us write H=U — P. The function H, as monotone non-decreasing, 
is almost everywhere derivable, and if we denote by E the set of 
the points x of I 0 at which H’(x) ^ e, we find, by (6.2) and Theo- 
rem 7.4, Chap. IV, that |F?| < t. 

Now at each point xel 0 where the function H is derivable, 
U(x)=H'(x)+P(x); hence P(x)> — oo and P(x)^U(x) — e^f(x) — e 
at almost all the points x of I 0 — E. Therefore, since |F?|=^r, t being 
an arbitrary positive number, it follows that — oo ^P(x)^ f(x) at 
almost all the points x of I 0 . By symmetry this gives also 
-fo o P (x) f (x), and finally oo^zP'(x)—f(x) almost everywhere 
in I 0 . 

(6.3) Theorem. If two functions f and g are almost everywhere equal 
on a figure R 0 and one of them is F-integrable on R 0 , so is the other 
and the definite F-integrals of f and g over R 0 are equal. 

Proof. Suppose that the function / is ^-integrable and denote 
by A the value of its definite integral over R 0 . Let e be any positive 
number and let V and V be two functions of an interval, which 
are respectively a major and a minor function of / on R 0 and which 
fulfil the inequalities 
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(6.4) U(R 0 ) ^A^V(R 0 ) and U(R 0 ) — V(R 0 ) ^ e/3. 

Let us denote by E the set of the points x at which f(x)^g(x). 
The function equal to +oo at all the points of E and to 0 every- 
where else is therefore almost everywhere zero, and by Theorem 3.2 
has a major function 0 such that O^G(R 0 )^.e/3. We have G{ x) = -f oo 
at each point xeE and writing U^U+G, V 1 ~V — G, we see that 
the functions of an interval U 1 and V x thus defined are respectively 
a major and a minor function of the function g on R 0 . Moreover 
by (6.4), U^R^A^V^R,) and U 1 (R 0 )~V l (R 0 )^e. Therefore 

the function g is c/'-integrable on R 0 and A — (t?) f g{x)dx, which 
completes the proof. 

(6.5) Theorem . Every function f which is &-integrable and almost 
everywhere non-negative on a figure R 0 , is summable on this figure. 

Proof. We may assume, by Theorem 6.3, that the function / 
is everywhere non-negative on R 0 . Therefore if V is any major 
function of /, we have V (x)^ f(x)^0 at every point xeR 0J and 
consequently, by Theorem 3.1, the function V is monotone non- 
decreasing. Its derivative U'(x) is therefore summable on i? 0 , and, 
since U\x)^ f(x)^0 almost everywhere, the function / is also sum- 
mable on R 0 . 

Theorem 6.5 shows that, although Perron integration is more general 
than Lebesgue integration, the two processes are completely equivalent in the 
case of integration of functions of constant sign. 

§ 7. Derivates of functions of a real variable. Certain 
of the theorems of §§ 2 and 3 can be given a more complete statement 
when we deal with functions of one real variable. We shall begin 
with the following proposition which is due to Zygmund: 

(7.1) Theorem. If F(x) is a finite function of a real variable such 
that (i) limsup^(#— h)^F(x)^limsupF(x + h) at every point x, and 

/i— >0-|- — >-0— j— 

(ii) the set of the values assumed by F(x) at the points x where F + (x)^.0 
contains no non-degenerate interval, then the function F is monotone 
non-deei reusing. 

Proof. Suppose, if possible, that there exist two points a 
and b such that a <6 and that F(b)< F(a). Then, denoting by E 
the set of the points x at which F + (x)^Q, we can determine a value y 0 
not belonging to the set F[E] and such that F(b)<.y 0 <.F(a). Let 
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Ju be the upper bound of the points a; of [a, 6], for which F(x)^y 0 . 
We shall obviously have a<x 0 <^b, F(x 0 ) = y 0 , and F(x)^y 0 for 
each point x of the interval [>„, b}. Therefore F + [x lj )^.0, although .r 0 
does not belong to E. This is in contradiction with the definition 
of the set E. 


Let us mention the following consequence of Theorem 7.1: 
l>ini’s Theorem, Given on an interval I=[a, 6] a continuous function F[x) % 
the upper and lower bounds of each of its four Dini derivates are respectively equal 


*V 2 ) 


where x } and x 2 are any 


to the upper and lower bounds of the ratio 
points of I. "2 * c i 

Let, for instance, m be the lower bound of the derivate F+(x) on the inter- 
val /, and suppose first that m > — oo. Then, if m' denotes any finite number 
less than m, the function F(x) — m'*x has everywhere on [ a , b\ its upper right- 
hand derivate positive; and so by Theorem 7.1. F(x 2 )—F(x 1 ) :m'-(x 2 — x^, 

and therefore also [F{ x 2 ) — F(x l )]/(x 2 — x k ) ' tn, for every pair of points j, and x 2 
of 1 such that x y <x 2 . Since the inequality just obtained is trivial in the case 
m ~ oo, the theorem follows. 

An immediate consequence is the following theorem: 


If any one of the four Dini dedicates of a continuous function is continuous 
at a point , so are the three others , and all four derivates in question are equal , so 
that the function considered is derivable at this point . 

These two propositions were proved by IT. Dini [I] in 1878. 


(7.2) Theorem. If H is a finite function of one variable such that 

(i) lim sup H(x — h) ^ H(x) ^ lim sup H(x -f h) at every point x y and 

(ii) H+(x)^Q at every point x except at most at those of an enumer- 
able set, then the function II is monotone non-decreasing . 

Proof. Let f be a positive number and write F(x)~H(x) + ex. 
We have 0 at each point x except at most at those of 

a finite or enumerable set E . The set F[E ] being, with E , at most 
enumerable, it follows from Theorem 7.1 that the function 
F(x)~H(x) + fx is non-decreasing for each e>0; and by making 
f->0 we obtain the assertion of the theorem. 


(7.3) Theorem . Suppose that F is a continuous function and g 
a SP-integrable function of a real variable , and that , further , we 
have (i) F+(x) ^ g(x) at almost all points x and (ii) jr*~(a&)> — oo 
at every point x , except at most at those of an enumerable set ; then 

b 

(7.4) F(b)—F(a) > (F) f g(x)dx 

a 

for every pair of points a and b such that a <6. 
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//, in addition , ( i x ) F+ (x)~J^ g(x)^F*~ (x) at almost all points x 
and (iii) F*(x) > — 00 and F^(x) <-f 00 at every point x except at most 
at those of an enumerable set , then the function F is an indefinite 
F-integral of g. 

Proof. We may obviously assume that (x)^g(x) at every 
point x. Therefore, denoting by V any minor function of <7, and 
writing H—F — F, we shall have H+(x)^F^(x)— V+(x)7^0 at every 
point Xj except at most at those of a finite or enumerable set where 
F+(x)== — 00 . Further, since the function F is continuous, the 
inequality V(x)<+ 00 , which holds at every point x, implies that the 
function H satisfies the condition (i) of Theorem 7.2. Consequently, 
by Theorem 7.2, H(b)—H(a)^ 0, i.e. F(b)-~-F(a)^V (b)—V (a), and 
since F is any minor function of g, we obtain the inequality (7.4). 

The second part of the assertion is an immediate consequence 
of the first part. 


As we easily see, the condition of continuity of the function V in the 
first part of Theorem 7.3 may be replaced by the condition (1) of Theorem 7.1. 

Theorem 7.3 constitutes, on account of Theorem 7.4, Chap. IV, a general- 
ization of the following theorem of Lebesgue fl, p. 122; 2; 3; 4; II, p. 183]: 
in order that one of the derivates of a continuous function , supposed finite, be sum- 
mable , it is necessary and sufficient thai this function be of bounded variation; its 
absolute variation is the integral of the absolute value of the deri/vate in question . 
Let us add that in the case in which the function F is assumed to be of bounded 
variation, Theorem 7.3 is included in Theorem 9.6 of Chap. IV. 

The condition (ii) of the first part of Theorem 7.3, as well as the con- 
dition (ii,) of the second, is quite essential for the validity of the theorem. It is 

possible, in fact, to give an example of a continuous function whose derivative 
exists everywhere and is summable, without the function being the indefinite 
integral of its derivative, and this because the latter assumes infinite values. 
To see this, we shall first show that given any closed set E of measure zero in 
an interval J 0 =[a, b], there exists a function G , absolutely continuous and in- 
creasing in J 0 , which has a derivative everywhere in J 0 and fulfils the conditions 

(7.5) G'(x )~- foo for xeE and 6f'(:r) ^ -foo for X€J 0 ~E* 

Let us suppose for simplicity that E contains the end-points a and b 

of J 0 and let us denote by {[a nt bn]} the sequence of the intervals contiguous 

to E. Let {/i n } be a sequence of positive numbers such that 

00 

(7.6) Umh„/(h„—, a„)=+oo and (7.7) = l 

n /«=-! 

oo 


(it suffices to write, for instance, b n ~\r n —\r n ^ , 


where r n =* 


_L_ y 

b — a 


(bi-ai)). 


Let us write • 


(7.8) 



h n /(x—an) 12 (6/1— as) 12 , 
-boo. 


when a n <T<b n , 
when xeE. 
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Thus defined the function g(x) is non-negative on J 0 and summable on J 0 , 

b 

since f g(x)dx—nhn, so that by (7.7) we have jg(x)dx=7 1. Let 0 be the indefi- 

a„ a 

nite integral of g on J 0 . In order to verify that the function G fulfils the con- 
ditions (7.5), we observe that the function g(x) is continuous for every xeJ 0 — E ; 
on the other hand, if we denote by m n the lower bound of g(x) in fo n , b n ] we derive 
from (7.6) that lim m rt =lim 2h n /(b n — a n )=+oo, from which it follows that 

n n 

Jim g(x)~ 4 -oo=< 7 (x 0 ) for every x 0 eE. Consequently G / (x)—g(x) for every x , 

x-+x 0 

and therefore, by (7.8), the conditions (7.5) hold. 

Now let (cf. Chap. ITI, (13.4)), H(x) be a continuous non -decreasing sin- 
gular function on t/ 0 , which is constant on each interval contiguous to the set E , 
and such that H(a)^H (b). Let us put F=G+H. As we verify easily from (7.5), 
we have F'(x)—G'(x) = g(x) at every point x of J 0 . The function F therefore 
has everywhere a derivative which is summable on J 0 . But, since H is the 
function of singularities of the function F , the latter is certainly not absolutely 
continuous, let alone the indefinite integral of its derivative. (The functions 
G and F provide at the same time an example of two functions whose deriv- 
atives, finite or infinite, exist and are everywhere equal, without the difference 
G — being a constant; cf. H. Hahn [1] and S. Ruziewicz [1].) 

In connection with these examples, it may be interesting to mention the 
following theorem ( vide G. Goldowsky [1] and L. Tonelli [8]): 

(7.9) Theorem. If a continuous function F has a (finite or infinite) derivative 
at each point of except perhaps at the points of an enumerable set, and if this 
derivative is almost everywhere non-negative , the function F is monotone non-de- 
creasing . 

Proof. Let E be the set of the points x such that the function F is not 
monotone in any neighbourhood of x. The set E is evidently closed, and the 
function F is non -decreasing on every interval contained in C E. It therefore 
has to be proved that the set E is empty. 

Suppose, if possible, that E (0, and denote for every positive integer n 
by P n the set of the points a; for which the inequality 0<x —x<l/n implies 
F(x') — F(x)-^—(x' — x) however we choose x'. Similarly let Q n be the set of the 
points x for which the same inequality implies F(x‘) — F(x)^—2(x' — x). We 
see easily that the sets P n and Q n are closed, and that they cover the whole straight 
line JR, except at most the finite or enumerable set of the points at which the 
function F is without a derivative. Consequently, by Baire’s Theorem (Chap. II, 
Theorem 9.2) the set E must contain a portion which either 1° reduces to a single 
point, or else 2° is contained in one of the sets P n , or finally 3° is contained in 
one of the sets Q n . The first case is obviously impossible, since the set E has no 
isolated points. Let us therefore consider case 2°, and suppose that there exists 
a positive integer n 0 and an open interval I such that Q^E»I(ZPn 0 - We may 
clearly suppose that 6(1) < l/n Q . Since by hypothesis, F'(x)^0 almost everywhere, 
the set P n<> is certainly non -dense. Let [a, 6] denote any interval contiguous 
to E-I. The function F is then non -decreasing on [o, 6] and this contradicts the 
fact that, since a and b belong to and b — o< 1 /«„, we Lave 

F (b) — F (a)< — (b— a) < 0. 
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[§ 8 ] 


There now remains only case 3°. In this case there exists an open interval 7 
such that the set Ed is non-empty and is contained in one of the sets Q„. But 
then F^(x )^ — 2 everywhere in 7, and F'(x)>0 almost everywhere, in 7. Therefore, 
by Theorem 7.3, the function F is non -decreasing in 7, and this again is impos- 
sible since the interval 7 contains points of E in its interior. 

We thus arrive at a contradiction in each of the three cases, and this 
proves our assertion. 

Let us mention a corollary of Theorem 7.9: 

If F is a continuous function having a derivative at every point , except per- 
haps at those of an enumerable set , and if there exists a finite constant M such that 
|F'(x)|<[J/ at almost all points x , then the function F is the indefinite integral of 
its derivative. 


* §8. The Perron-Stleltjes integral. Among the various gen- 
eralizations of the Stieltjes type for the Perron integral (vide for in- 
stance R. L. J effery [2; 3], J. Bidder [9] and A. J. Ward [3]), that 
due to Ward has the advantage of including the others and of defining 
the process of Stieltjes integration with respect to any finite func- 
tion whatsoever. In this § we shall give the fundamental defi- 
nitions and results of the theory of Ward. For a deeper analysis, 
in the case in which the function with respect to which we integrate 
is of generalized bounded variation in the restricted 
sense (vide below, Chap. VII) the reader should consult the memoir 
of Ward referred to. 

As in the two preceding §§ we shall consider only functions 
defined in JR V i. e. functions of a linear interval or of a real variable. 
We shall, moreover, restrict ourselves to integration of finite func- 
tions. This restriction is essential for the methods which we shall 
employ. 

Given two finite functions / and (?, an additive function of an 
interval U will be termed major function of / with respect to 0 on 
an interval 7 0 , if to each point x there corresponds a number £>0 
such that TJ (I)^f(x)G(I) for every interval I containing x and 
of length less than f. The definition of minor function with respect 
to O is symmetrical, and by following the method of §6, p. 201, with 
the help of the notions of major and minor functions with respect 
to Oj we define Perron- Stieltjes integration , or PS-integration with 
respect to any finite function O whatever. The ^-integral of a func- 
tion / with respect to a function 0 on an interval I 0 ~ [a, b] will 

b 

be denoted by (<£$) f f(x)dO(x ), or by (<f<£) f f(x)dO(x). 
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If U and V are respectively a major and a minor function 
of the same function / with respect to the same function (7, their 
difference U — V is evidently monotone non-decreasing. The criterion 
for ^-integrability of a function is entirely similar to that for 
«F-integrability given in § 6, p. 201, and it follows that every func- 
tion which is <?£-integrable on an interval Z 0 , is so equally on each 
subinterval of Z 0 . We are thus led to the notion of indefinite 
S*S- integral with respect to any finite function G. This indefinite inte- 
gral is an additive function of an interval, and is continuous at each 
point of continuity of the function (7. Finally we observe that the 
«F£-integral possesses the distributive property which we may express 
as follows: If each of the tivo finite functions f x and f 2 is <FS-integrable 
on an interval I 0 with respect to each of the two functions G x and G 2 , 
then each linear combination of the functions f x and f 2 is SPS-integrable 
with respect to each linear combination of the functions G x and G 2 , 
and we have 

(ps) j K/.+v,) w,+w= 2 a ‘ b ^ S) f f ‘ d0 > 

h i, A— 1,2 7 o 

for all numbers a }J a 2 , b x and b 2 . 

If G(x) = x for every point x (or, what amounts practically 
to the same, if <?(Z)=|Z| for each interval Z) ^-integration with 
respect to G coincides with ^-integration. In fact, if / is any finite 
function, each major [minor] function of / with respect to the 
function G(x)—x in the sense of Ward, is at the same time a major 
[minor] function of / in the sense of the definition of § 3; the con- 
verse is not true in general, but we see at once that if U is a major 
function of / in the sense of § 3, the function U(x) + ex is for each 
e>0 a major function of / with respect to G(x) = x. Thus the 
Perron-Stieltjes integral includes the ordinary Perron integral, at 
any rate as regards integration of finite functions. On the other 
hand, the Perron-Stieltjes integral includes also the Lebesgue- 
Stieltjes integral. We have in fact 

(8.1) Theorem . A finite function f integrable in the Lebesgue-Stieltjes 
sense on an interval Z 0 =[a 0 , 6 0 ] with respect to a function of bounded 
variation G , is so also in the Perron-Stieltjes sense and we have 

(8.2) (PS) f /dG=//dG-{/(a 0 )[^a 0 )-G(a 0 -)] + /(6 0 )[G(6 0+ )-G(6 0 )]}. 
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Proof. Let us denote for brevity, by A the right-hand side 
of the relation (8.2). We may evidently assume that the function / 
is non-negative and it is enough to consider only the following 
two cases: 

1° G is a continuous non-decreasing function. The 
proof is then just as in Theorem 3.2. Let e be any positive number. 
Since the function / is finite, there exists by the theorem of Vitali- 
Carath6odory (Chap. Ill, § 7) a lower semi-continuous function g, 
integrable (G) in the Lebesgue-Stieltjes sense, such that g(x)'>f(x) al 
each point x and such that f [ g(x)—f(x)]dG(x ) <£. Denoting by C 
h 

the indefinite integral (G) of the function g, and taking account 
of the lower semi-continuity of g, we see easily that U is a major 
function of / with respect to G on I 0 . Moreover, the function G being 
continuous by hypothesis, the number A is equal to the integral 

r“ 

fdG and we find Q*^.U(I 0 )~A<e. By symmetry we determine 

a n 

also a minor function V of / with respect to G on I 0 in such a manner 
that 0^4 — V (Z 0 )< £, and this establishes ^S-integrability off 
on Z 0 and at the same time the validity of the formula (8.2). 

2° G is a non-decreasing saltus-function. Let us denote 
by {x n }„ 1.2,... the sequence of the points of discontinuity of G 
which are in the interior of the interval Z 0 ; and let e be any positive 
number and {&„}„= 1,2.... a sequence of positive numbers such that 

(8.3) yk„-[G(x„+) — G(x n —)]<e and limft n = + oo. 

n 

n 

Let us define a function h in fi„ by writing: h(x) — f(x) for all 
the points x of /„ which are distinct from the points x„-, h(x„)=f(x„)+k„ 
for n= 1, 2, ...; and h(x)=f(a 0 ) for x<a 0 , and h(x)=f(b 0 ) for x>b 0 . 
Finally let us write, for each interval I = [a,b ], 

& 

U (I)=fh(x)dG(x)-{h(a)[G(a)-G(a-)] + h(b)[G(b+)— G(b)])- 

a 

The function of an interval U thus defined is evidently ad- 
ditive, and as we easily verify, is a major function of / with respect 
to G on J 0 . Moreover, it follows at once from (8.3) that 0^U(I o ) — 
Similarly we determine a minor function 7 of / with respect to G 
so as to have 0^.4 — V (Z 0 ) ^ £; hence A = (j?S) f / dG, and this 
completes the proof. 1 - 
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Formula (8.2) brings out the fact that the definite Perron -Stieltjes and 
Lebesgue-Stieitjes integrals are not always equal, even for a function / integrable 
in both senses. This is due to the fact that the indefinite integral of Lebesgue- 
Stieitjes is not in general an additive function of an interval. We could, of course, 
modify the definition of this integral so as to ensure its additivity as a function 
of an interval. The term in brackets { } would then disappear from the 

formula (8.2), but it would then be necessary to give up the additivity of the 
indefinite Lebesgue-Stieitjes integral considered as a function of a set (cf. 
Chap. VIII, §2). 

Let us mention further the following generalization of Theo- 
rem 6.1 on derivation of the indefinite Perron integral: 

(8.4) Theorem . If P is an indefinite SPS-integral of a finite function f 
with respect to a function G , then , at almost all points x , the ratio 

(8.5) [P(I)~f(x)G(I)]f\I\ 

tends to 0 as (5(7)— ^0, where I denotes any interval containing x. 

Hence at almost all points x , P(x)=f(x)G(x) and P(x)=f(x) G(x) 
or else P(x) = f(x)G(x) and P(x) = f(x)G(x) according as f(x)^ 0 or 
f(x)*^L 0; in particular P'(x) — 0 at almost all points x where f(x) = 0. 

Proof. The proof is quite similar to that of Theorem 6.1. 
Let 7 0 be an interval, e a positive number, and U a major function 
of / with respect to G on 7 0 such that t/(7 0 ) — P(7 0 )<r 2 . We write 
H=U — P. The function H is monotone non-decreasing, and we 
have H'(x)<ie at every point xel 0 except at most those of a set E 
of measure less than e . Now, since U (I) — f(x)G(I)^ 0 for every 
point x and for every sufficiently small interval I containing x , the 
lower limit of the ratio (8.5), as (5(7)-*0, exceeds — r at each point x 
except at most at those of E. Therefore, e being any positive number, 
this limit is non-negative for almost all points x. Combining this 
with the symmetrical result for the upper limit of the same ratio, 
we complete the proof. 

Another generalization of Theorem 6.1, also due to Ward, 
uses the following definition of relative derivation, which is slightly 
different from that given in Chap. IV, § 2 (cf. A. J. Ward [3] and 
A. Roussel [1]). 

Given two finite functions of a real variable F and G, we shall say 
that a number a is the Roussel derivative of the function F with 
respect to G at a point # 0 , if when I denotes any interval containing x 0 , 
we have (i) P(7)-a-<?(7) -> 0 and (ii) |P(7)-a*<7(7)|/0(<?;7)->0, 
as c5(7)-*0 (the ratio in (ii) is to be interpreted to mean 0 whenever 
its numerator and denominator vanish together; O ((?;!) denotes, 
in accordance with Chap. Ill, p. 60, the oscillation of G on 7). 
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When the oscillation of the function 0 at x 0 is finite, the condition (ii) 
evidently implies (i); however, when o(G;x 0 ) — - foo, the condition (i) plays an 
essential part, whereas (ii) is then satisfied independently of F and of a. 

It is also to be observed that when o (G; x 0 )s -foo, and when F is a function 
which has the relative derivative J?o(x 0 ) (cf. Chap. IV, § 2, p. 109), the latter is 
also the Roussel derivative of F with respect to G. Finally, in the case of deri- 
vation with respect to monotone functions, the two methods are completely 
equivalent. In particular therefore, when G{x)=x, Roussel derivation with 
respect to G it equivalent to ordinary derivation. 

The proof of the theorem on Roussel derivability of the in- 
definite ^-integral is much the same as that of Theorems 6.1 and 8.4; 
it depends, however, on the following lemma which may be regarded 
as a generalization of a result of W. Sierpinski [4]. 

(8.6) Lemma . Let G be a finite function of a real variable , E a bounded 
set in 11^ and 3 « system of intervals such that each point of E is a 
( right - or left-hand ) end-point of an interval (3) of arbitrarily small 
length. 

Then , given any number //<|tf[i?]|, we can select from 3 o finite 
system {Ik) of non-overlapping intervals such that 

k 

Proof. Suppose, for simplicity, that the set E lies in the open 
interval (0,1). For each positive integer n, let A„ and B n denote 
respectively the sets of the points of E each of which is respectively 
a left- or right-hand end-point of an interval (3) contained in (0,1) 
and of length exceeding 1/n. We evidently have f?=lim (A n +B n ) and 

n 

there therefore exists a positive integer « 0 such that \G[A„ n + B„ J| >/z. 
Suppose, for definiteness, that |(r[A„ o ]| > I//. 

Now, it is easily seen that, if |(?[.4.„J|=+oo, there exists a point x 0 
such that |(?[A„ 0 - J]|=-(-oo for any interval J containing x 0 in its in- 
terior. Hence, from the family of intervals (3) whose left-hand end- 
points belong to A„ 0 and whose lengths exceed 1 /» 0 , we can ob- 
viously select an interval I so as to have |G t [i']| > |(?[A / , 0 -l']|> 

Suppose now that |G t [-4. /lo ]| < -)-oo. Then, by induction, we can 
extract from 3 a finite sequence of intervals {J*=(a*, 6 *)}a=i, 2 ,....p 
in such a manner that, writing for symmetry b 0 =0 and a p+) =l, 
we have: (i) 6* — o* > l/n 0 for *= 1, 2, ..., p, (ii) 6*_i < a* and 
«*)]|< (|G ( [A no ]|— ip)/» 0 for *=1,2, ..., p, and (iii) the 
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interval (b p , a p+ \) = {b p , 1) contains no points of A n „. Since, on 
account of (i), we certainly have p < n 0 , it follows from (ii) and 
(iii) that 2’|G t [J*]| > p-(\0\_A„^\— »/» 0 > e. that 

k 

the system of intervals {!*} fulfils the required conditions. 

(8.7) Theorem . Every finite function f which is SPS-integrable with 
respect to a function 0 on an interval I 0 , is the Roussel derivative with 
respect to G of its indefinite &S-integral at each point x of I 0 except 
at most those of a set E such that \G[E]\ = 0. 

Proof. Let e be any positive number and U a major function 
of / with respect to G such that U (I 0 )~P(I 0 ) < * 2 , where P de- 
notes the indefinite ^-integral of /. Let us write H—U—P , and 
denote by E e the set of the points x of I 0 for which there exist 
intervals I of arbitrarily small lengths, such that x el and that 
H(I) It follows that each point of E e is an end-point of 

intervals /, as small as we please, which fulfil the inequality 
H(I) ^\e-\G[I]\. Therefore, denoting by fi any number le&s than 
\G[E e ]\ and applying Lemma 8.6, we can determine in I 0 a finite 
system of non-overlapping intervals < I k } such that H (I k )^±e-\G[Ik]\ 
for Jc=l 9 2, ..., p and that £\G[I k ]\ ^ I//. Consequently, since H 

k 

is non-decreasing, e 2 >IT(J 0 )^£u/4; and therefore fi< 4e, and hence 
\G[E t ]\^4e. 

Now let x be any point of I 0 . We have for every sufficiently 
small interval I containing x , 

P(I)-f(x)G(I)=U(I)-f(x)G(I)-H(I)^-H(I )>-e 2 ; 

and, unless x belongs to the set E s , we also have 

P(I)-f(x)G(I)^-H(I)^- f .\G[I]\^-e-0(G;I). 

Combining this with the similar upper evaluations of P(I)—f(x)G(I) 
obtained by symmetry, we see, since « is an arbitrary positive 
number, that / is the Roussel derivative of the function P with 
respect to G, at every point x of I 0 except at most those of a set E 
such that |(?[jE]| = 0. 
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Functions of generalized bounded variation. 

§ 1. Introduction. The definition adopted in Chap. I (§ 10) 
as starting point of our exposition of the Lebesgue integral, con- 
nects the latter with the conception of definite integral due to Leibniz, 
Cauchy and Riemann (cf. Chap. I, § 1 and Chap. VI, § 1). On ac- 
count of the results of § 7, Chap. IV, we may, however, also regard 
the Lebesgue integral as a special modification of that of Newton 
(ef. Chap. VI, § 1) and define it as follows: 

( L) A function of a real variable f is integrable if there exists 
a function F such that (i) F‘ (x)=f(x) at almost all points x and 
(ii) F is absolutely continuous. 

The function F ( then uniquely determined apart from an ad- 
ditive constant) is the indefinite integral of the function f. 

A definition of integral is usually called descriptive when it is based 
on differential properties of the indefinite integral and therefore connected with 
the Newtonian notion of primitive; this is the case of the definition ( L ) of the 
Lebesgue integral. In the note of F. Riesz [9] the reader will find an elementary 
and elegant account of the fundamental properties of the Lebesgue integral based 
on a descriptive definition differing slightly from the one given above (an ac- 
count based directly’ on the definition (L) is given in the first edition of this book). 

By contrast to the descriptive definitions, we call constructive the 
definitions of integral which are based on the conception of definite integral 
of Leibniz-Cauchy, i. e. on approximation by the usual finite sums. Thus for 
instance, the classical definition given by H. Lebesgue [1] in his Thesis may 
be regarded as constructive (the reader will find a very suggestive explanation 
of this definition in the note by H. Lebesgue [8]); cf. also the definitions of 
Lebesgue integral given in the following memoirs: W. H. Young [3], T. H. Hil- 
debrandt [1], F. Riesz [1] and A. Denjoy [7; 8]. 

As is readily seen, the definition (L) constitutes a modification 
of that of the integral of Newton, in two directions: firstly, a gen- 
eralization which enables us to disregard sets of measure zero 
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in the fundamental relation F'(x)=f(x)-, and secondly, an essential 
restriction, which excludes all but the absolutely continuous 
functions from the domain of continuous primitive functions con- 
sidered. Some such restriction is, in fact, indispensable, unless we 
give up the principle of unicity for the integral: to see this it is enough 
to consider, for instance, singular functions which are continuous 
and not constant, and whose derivatives vanish almost everywhere 
(cf. Chap. Ill, § 13, p. 101). 

But although the condition (ii) cannot be wholly removed 
from the definition (L), it is possible to replace it by much weaker 
conditions, and the corresponding generalizations of the notion of 
absolute continuity give rise to extensions of the Lebesgue integral, 
known as the integrals s', and 9 of Denjoy. 

We shall treat in this Chapter two generalizations of absolutely 
continuous functions: the functions which are generalized abso- 
lutely continuous in the restricted sense or ACG,, and 
those which are generalized absolutely continuous in the 
wide sense or ACG. If, in the definition ( L ), we replace the con- 
dition (ii) by the conditions that the function F is ACG, or ACG 
respectively, we obtain the descriptive definitions of the integrals S', 
and 9. It must be added however that the second of these defini- 
tions requires a simultaneous generalization of the notion of deriv- 
ative, to which is assigned the name of approximate derivative 
(or asymptotic derivative) and which corresponds to approximate 
continuity ( vide Chap. IV, § 10). A function which is ACG (unlike 
those which are absolutely continuous or which are ACG,) may in 
fact fail, at each point of a set of positive measure, to be derivable 
in the ordinary sense, and yet be almost everywhere derivable in 
the approximate sense. Therefore, in order to obtain the definition 
of the integral 9 from the definition (L), it is necessary not only to 
modify the condition (ii) as explained above, but also to replace 
in the condition (i) the ordinary by the approximate derivative. 

The integrals S>, and S' will be studied in the next chapter; the preliminary 
discussion of their definitions just given, is intended to emphasize the important 
part played by the generalizations of the notion of absolute continuity, which are 
treated in this chapter. The results of which an account is given in the following §§ 
are essentially due to Denjoy, Lusin and Khintchine. The first definition of 
the integral S', was given in notes dating from 1912 by A. Denjoy [2; 3] who 
employed the constructive method based on a transfinite process (vide Chap. VIII, 
§ 5). These notes at once attracted the attention of N. Lusin [2] who originated 
the descriptive theory of this integral. Finally, A. Khintchine [1; 2] and 
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A. Den joy [4] defined, independently and almost at the same time, the process 
of integration ^ as a generalization of the integral A systematic account 
of these researches may be found in the memoir of A. Den joy [6]. 

As shown by W. H. Young [6] the generalization of the Denjoy inte- 
grals can be carried still further if we give up, partially at least, the continuity 
of the indefinite integral. For subsequent researches in this direction, vide J. < \ Bur- 
kill [5; 6; 7J, J. Kidder [6;7], M. D. Kennedy and S. Pollard [*1], S. Ver- 
blunsky[l], and J. Marcin ki e wicz and A. Zygmund [1], 

Except in a few general definitions in § 3, we shall consider 
in this chapter only functions of a real variable. As therefore we 
shall be employing in Ii 1 notions established in the preceding chapters 
for arbitrary spaces R m , it will be convenient to add a few com- 
plementary definitions. 

We shall say that a point a is a right-hand point of accumulation 
for a linear set E , if each interval [a, a + h], where h> 0, contains 
an infinity of points of E. A point of E which is not a right-hand 
point of accumulation for the set E is termed isolated on the right 
of this set. The definitions of left-hand points of accumulation and 
of points isolated on the left are obtained by symmetry. 

Similarly, for each linear set E, in addition to the densities 
defined in § 10, Chap. IV, we define at each point x four unilateral 
densities: two outer right-hand , upper and lower , and two outer left- 
hand, upper and lower , densities of E . We shall understand by these 
four numbers the values of four corresponding Dini derivates of 
the measure-function (cf. Chap. IV, § 6) of E at the point x. If at 
a point x , two of these densities on the same side (right or left) are 
equal to 1, the point x is termed unilateral (right- or left-hand) point 
of outer density for the set E . The term “outer’’ is omitted from 
these expressions if the set E is measurable. 

Finally, we shall extend the notation of linear interval and 
denote, for each point a of R x , by (— oo, a), (— oo, a], (a, +oo) 
and [a, + oo) the half-lines x<a, x^a, x >a and x^a respectively. 

*§ 2. A theorem of Lusin. While discussing the significance 
of the condition (ii) in the definition (L) of an integral, we remarked 
that a continuous function which is almost everywhere derivable 
is by no means determined (apart from the additive constant) when 
we are given its derivative almost everywhere. It is, however, of 
greater interest that, for a function /, the property of being almost 
everywhere the derivative of a continuous function, itself represents 
no restriction at all, except, of course, in so far as it implies that 
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the function / is measurable and almost everywhere finite (this last 
assertion follows, for instance, from the corollaries to Theorem 10.1, 
p. 236). We shall prove this result, which is due to N. Lusin [I; 4] 
(cf. also E. W. Hobson [II, p. 284]), by means of two lemmas. 

(2.1) Lemma. If g is a function summdble on an interval [a, ft], there 
exists , for each c>0, a continuous function 0 such that (i) G'(x)—g(x) 
almost everywhere on [a, ft], (ii) (? (a) =(?(&) =0, and (iii) |<7(®)|^e 
at every point x of [a, ft]. 

Proof. Let H(x) be the indefinite integral of g(x). We insert 
in [a, ft] a finite sequence of points a=ao<ai<...<a„— ft such 
that the oscillation of H is less than e on each of the intervals 
[«/, a l+ 1 ] where i=0, 1, ..., n— 1. Let F (cf. (13.4), Chap. Ill, p. 101) 
be a function which is continuous and singular on [a, ft], monotone 
on each interval («/, a i+i ] and coincides with the function H at the 
end-points of these intervals. Writing G—H—F, we shall have 
(i) G'(x)=H'(x)—F'(x)=H'(x)=g(x) at almost all the points x of 
[o, ft], (ii) G(a)= G(b)= 0, and finally (iii) \G(x)\=\H(x)— P(®)| ^ e 
on each interval [o/, 0 /+i], and therefore on the whole interval [a, 6]. 

(2.2) Lemma. If g is a function which is summable on an interval 
J—[a , ft] and if P is a closed set in J, there exists for each e> 0 a con- 
tinuous function G such that (i) G'(x) = g(x) at almost all the points x 
of J — P, (ii) G(x)= 0 and G'(x)=0 at all the points x of P and 
(iii) |G(*-(-ft)| ^ e-|ft| for every x’of P and every h. 

Proof. Let us represent the open set J° —P as the sum of a se- 
quence (/*=(<**, ft*)}*=i, 2 ,... of non-overlapping open intervals, and in- 
sert in each interval 7* an increasing sequence of points {af} l= , 0 , 
infinite in both directions and tending to a* or ft* according as i-*— oo 
or i -> + oo. Let us further denote, for each k= 1, 2, ..., and 
i=0, ±1, ±2,..., by the smaller of the numbers f-(a^— a A )/(fe+|t|) 
and e-(ft A — a<[ +,) )/(S;-i- |i|). Lemma 2.1 enables us to determine in 
each open interval 7* a continuous function (7* such that Gk(x)=g(x) 
almost everywhere on I k , G(a ( t °)=0 for t=0, ±1, ±2, ..., and 
w ^ en a * +1) - M we now write G(x)=Gm(x) for 

xelk and fc=l, 2 ,..., and G(x)—0 elsewhere on R l} we see at once 
that the function G is continuous and fulfils the required conditions 
(i), (ii) and (iii). 
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(2.3) Lusin 9 s Theorem . If f is a function which is measurable and 
almost everywhere finite on an interval J = [a, b], there always exists 
a continuous function F such that F'(x) = f(x) almost everywhere on J. 

Proof. We shall define by induction a sequence of continuous 
functions {G„} n =< u,..., each of these functions being almost everywhere 
derivable, and a sequence of closed sets {P/,}n-o,i, .. in J , such that, 

n n 

writing Q n =Y> Pk and F n =2^Gk, the following conditions will be 

*=0 k -o 

satisfied for n— 1 , 2 ,.... 

(a) F' n (x)=f(x ) for xeQ n , 

(b) G„(x)= 0 for xeQ n _ v 

(c) !©„(* + fc)|<|fc|/2" for every xeQ n , and every h, 

(d) \J-Q„\<l/n. 

For this purpose, we choose G 0 (x)~ 0 identically and P o =0, 
and we suppose that for n=0, 1 , ...,r the closed sets P„ and the 
continuous functions G„, almost everywhere derivable, have been 
defined so as to satisfy the conditions (a), (b), (c) and (d) for each 
n C- r. Since the function / is measurable and almost everywhere 
finite, and since the function F r is almost everywhere derivable, 
we can determine a measurable subset E r of J —Q r such that 

(2.4) \J — Q r — E r \ < l/(r +1), 

and such that the derivative F' r (x) exists at each point x of the 
set E r and is bounded, together with the function f(x ), on this set. 
Hence by Lemma 2.2, we can determine a continuous function G>+i, 
almost everywhere derivable, in such a manner that (i) G' r + \{x)= 
=f(x) — F’r(x) at almost all points of E r C J — Q r , (ii)G r +i(#)=(?r+i(#)=0 
at all points of Q n and (iii) \G r +i{x -^h)] ^ \h\l2 r+l for every xeQ r 
and every h. 

Now it follows from the first of these conditions and from (2.4), 
that there exists a closed set P r + \(Z.E r such that: 

(2.5) \J-Q-P r+l \<l/(r+l), (2.6) G' r ^{x)=f(x)-F' r {x) for xeP r+h 

and we easily verify, on account of (2.6), (ii), (iii) and (2.5), that the 
conditions (a), (b), (c) and (d), still remain valid for n=r- 1-1. 

Let us now write: 

(2.7) F (x) — lim Fk(x ) =£&k(x), (2.8) Q = ]imQh—HP/,. 

k k h k 
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In view of the condition (c), the series occurring in (2.7) con- 
verges uni£ormly r and the function F is therefore continuous. Let x 0 
be any point of Q. Then for every sufficiently large integer n we 
have x 0 *Q n , and since 

F(x 0 + h)—F(x 0 ) F n (x 0 + h)—F„(x 0 ) \'G k (x 0 + h)—G k (c r 0 ) 

h h h 7 

k~tl + 1 

we find, on account of the conditions (a), (b) and (c), that 


Jim sup 

A-M> 


FjXp+h) —F(Xq) 


h 


-/(* o) 


;i/2", 


and so, that F'(x 0 ) — f(x 0 ). Now it follows from the condition (d) 
that |J — Q| = 0; we therefore have F'(x) = f(x) at almost all the 
points x of J, and this completes the proof. 

Theorem 2.3 remains valid for any space R m : 

If f is a measurable function which is almost everywhere finite in a space R m , 
there exists an additive continuous function of an interval F such that F'(x) — f (x) 
almost everywhere in Rm- 

The proof is almost the same as that ol Theorem 2.3. We may also, in the 
loregoing statement, replace the ordinary derivative F'{x) by the strong de- 
rivative (vide Chap. IV, § 2, p. 106), but the proof is then more elaborate. 

It may be remarked further that Lusin's theorem in the form (2.3), is 
obvious if the function / is summable; for / is then almost everywhere the derivative 
of its indefinite integral. But this is no longer so when we wish to determine 
a function F with a strong derivative almost everywhere equal to / (cf. Chap. IV, 
p. 132). Nevertheless, it can be shown that given in a space R m any summable 
function of a point /, there always exists an additive continuous function of an inter- 
val , of bounded variation , F, such that F s (x) = f(x) almost everywhere in R m » 

Lusin’s method is applicable in several other arguments. It has been used, 
for instance, by J. Marcinkiewicz [1], to derive the theorem: 

There exists a continuous function of a real variable F which has the following 
property: with each measurable function /, almost everywhere finite , there can be 
associated a sequence of positive numbers {Tin} tending to 0 such that 

lim [P(a5-f hn)— F(x)]/kn=f(x) 
n 

at almost all the points x. 


§ 3. Approximate limits and derivatives. Given any 
function F defined in the neighbourhood of a point x 0 of a space R m , 
we shall call approximate upper limit of F at x 0 the lower bound 
of all the numbers y (+00 included) for which the set E[jF(o?)>y] 

X 

has x 0 as a point of dispersion (cf. Chap. IV, § 10). Similarly, the 
approximate lower limit of the function F at the point x 0 is the 
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upper bound of the numbers y for which the set E[_F(x) <y] has x 0 

X 

as a point of dispersion. These two approximate limits of F at x 0 are 
called also extreme approximate limits and denoted by lim sup ap F{x) 

x-+x 0 

and lim inf ap F(x) respectively. When they are equal, their com- 

X-+X 0 

mon value is termed approximate limit of F at x 0 and denoted by 

lim ap F(x). 

■r-Kto 

It is easily seen that if E is a measurable set for which x 0 is 
a point of density, then, in the preceding definitions of extreme 
approximate limits, the sets E[E(x) > y\ and E[E(x) <y] may 

X X 

be replaced by the sets E[E(x) >y; xeE] and E [F{x)<y, xeE] 

X X 

respectively. Hence 

(3.1) Theorem. If two function s coincide on a measurable set E , 
their approximate extreme limits coincide at almost all points of E, 
and in fact at every point of density of E . 

We see further that if x 0 is a point of density for a measurable 
set E and if the limit of F(x) exists as x tends to x Q on E f then this 
limit is at the same time the approximate limit of F at the point x 0 . 
Therefore, if a function F is approximately continuous (ef. Chap. W, 
p. 131) at a point x^ we must have F(x 0 ) = \im ap F(x). 

X-^Xq 

If x 0 is a point of density for a measurable set E and if, further, 
the function F is measurable on E , it is easily seen that the ap- 
proximate upper limit of F at x 0 is the lower bound of the num- 
bers y for which the set E[F(x)^y, x e E] has x % as a point of 

X 

density. It follows, by the definition of approximate lower limit, 
that with the same hypotheses on the set E and on the function F } in 
order that f = lim ap F(x) y it is necessary and sufficient that for each 

c>0 the set ’E[l—e^.F(x)^ ; l+e-,xeE] should have the point x 0 as 

X 

a point of density. 

Let us remark finally that the following inequalities hold be- 
tween approximate and ordinary extreme limits: 

(3.2) lim inf F (x) <; lim inf ap F(x)<, lim sup ap F (x) < lim sup F (x); 

J»-x 0 *-*X, X->Xo 

and hence the approximate limit exists and is equal to the ordinary 
limit, wherever the latter exists. 
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In order to understand better the meaning of the definitions of ap- 
proximate limits, it may be remarked that the definitions of the ordinary 
limits are expressible in a very similar form. Thus the upper limit of F(x) at x 0 
may be defined as the lower bound of all the numbers y for which x Q is not a point 
of accumulation for the set E [F(x)>y]. The inequality (3.2) then becomes obvious. 

X 

For functions of a real variable, in addition to the approximate 
limits defined above, and which in this case we call bilateral , we 
introduce also four unilateral approximate limits. The approximate 
upper right-hand limit of a function F at a point x 0 is the lower bound 
of the numbers y for which the set R[F{x)>y, a?># 0 ] has x 0 as 

X 

a point of dispersion. This limit is written lim sup ap F(x). The 

X->x 0 -f 

three other approximate extreme unilateral limits are defined 
and denoted similarly. 

These generalizations of the notion of limit lead very naturally 
to parallel generalizations of derivates. Thus, given a finite func- 
tion of a real variable F, we define at each point x 0 the approximate 
right-hand upper derivate Ff v (x 0 ) and lower derivate F dp (x 0 ), the 
approximate left-hand upper derivate F ap (x 0 ) and lower derivate 
F dp (x 0 ) y and the approximate bilateral upper derivate F ap (x 0 ) and 
lower derivate F\ v [x 0 ), as the corresponding approximate extreme 
limits of the rat ; o [F(x)~ F(x 0 )]j(x— x Q ) as x-+x 0 . When ail these 
derivates are equal (or, what comes to the same, when F ap (# 0 )==F ap (# 0 )), 
their common value is called approximate derivative of F at x 0 
and is denoted by F ap (# 0 ); if further, this derivative is finite, the 
function F is said to be approximately derivable at x Q . 

For some further generalizations, such as “preponderant derivates” 
(“nombres d4riv£s pr^ponderants”), and for a deeper study of the prop- 
erties of approximate derivates, the reader should consult A. Den joy [6] 
and A. Khintchine [5]. 

The properties of bilateral approximate limits, discussed 
above, can be taken over, with the obvious formal modifications, 
so as to apply to unilateral approximate limits. In particular, 
Theorem 3.1 may be completed as follows: 

(3.3) Theorem . If two functions of a real variable coincide on a mea- 
surable set E, their approximate extreme limits and their approximate 
derivates coincide respectively at almost all points of E , and in fact 
at every point of density of E. 

Also , if a function F is measurable on a set E , we have F f d p(x)^=Fs(x) 
at almost all the points x of E at which the function F has a derivative 
with respect to the set E. 
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# 4. Functions VB and VBG. We shall denote by V(F-, E), 
and call weak variation of a finite function F(x) on a set E, the upper 
bound of the numbers 2\F(bi)— F(a,)| where is any se- 

I 

quence of non-overlapping intervals whose end-points belong to E. 
If V(F; E) <+oo, the function F is said to be of bounded variation 
in the wide sense on the set E , or, simply, of bounded variation on E , 
or VB on E . 

In the special case in whicli the set E is a closed interval, we clearly have 

V (F; E) — W (F; E), i. e. the weak variation of the function F on E then coincides 
with its absolute variation in the sense of Chap. Ill, § 13. 

The definition of functions of bounded variation in the wide sense on a set 
thus constitutes a generalization (for functions of a real variable) of that of func- 
tions of bounded variation on an interval. If E is a linear figure formed of 
disconnected intervals we only get the inequality Y (F; E)^~W (F; E), but 
it is easy to see that even then the relation W (F; E) <-foo always implies 

V (F; E) < -f~oo. 

Plainly, every function which is VB on a set E is bounded 
on E and is VB on each subset of E. Again, any function F which is 
continuous on a set E and VB on a set AQE everywhere dense in E 
(cf. Chap. II, § 2) is VB on the whole set E (for then V(F; E)=^ V(F; A)). 
Finally, if F and G are two functions which are bounded on a set E 
and M denotes the upper bound 6f the absolute values of these 
functions on E, we have Y(aF + bG; 2?)^|a|* V(F; E) + \b\-Y(G; E) for 
each pair of constants a and b, and Y{F-G\ E)^ : M-[Y(F lt ,E)-\-Y(G-,E)] 
(cf. Chap. Ill, p. 97). Hence every linear combination, with constant 
coefficients, of two functions which are VB on a set, and the pro- 
duct of the two functions , are themselves VB on this set . 

A function F(x) is said to be of generalized bounded variation 
in the wide sense on a set E, or simply, of generalized bounded variation 
on E, or again, for short, VBG on E, if E is the sum of a finite 
or enumerable sequence of sets on each of which F(x) is VB. 
From what has just been proved for functions which are VB we 
see at once that every linear combination of two functions which are 
VBG on a set, and the product of the two functions, are themselves VBG 
on this set. 

(4.1) Lemma . In order that a function F be bounded and non- 
decreasing [of bounded variation ] on a set E, it is necessary and 
sufficient that F coincide on E with a function which is bounded 
and non-decreasing [of bounded variation] on the whole straight 
line 
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Proof. Let us denote for each x, by E ixt the set of the points 
of E which belong to the interval (— oo, *]. We shall consider two 
cases separately. 

1° The function F is bounded and non-decreasing 
on E. For each x, let O(x) denote the upper bound of the function 
F on the set E (x) , or else the lower bound of the function F on E, 
according as or E( X )= 0. The function 0 thus defined is evid- 

ently bounded and non-decreasing on the whole straight line 
and coincides with the function F on E. 

2° The function F is VB on E. For each point x, let 
V(x)~ V(F; E( X )) if and V{x)=0 if E( X ) = 0. We see at once 

that the function V (x) is monotone and bounded on the whole 
straight line R x and that V (x) — F(x) is non-decreasing and bounded 
on E. Hence, by what has just been proved in 1°, there exists a func- 
tion O(x) which is bounded and non-decreasing on if x and which 
coincides on E with V (x) —F(x). We have therefore F(x)=V (x) — 0(x) 
for every xeE, and since the function V (x) — O(x), as difference 
of two bounded monotone functions, is clearly of bounded variation 
on M 1 , this completes the proof. 

(4.2) Theorem. Let F be a function which is measurable on a set E 
and which is VB on a set E X C.E, Then (i) F is approximately deriv- 
able at almost all points of E x and (ii) there exists a measurable 
set E 2 such that E X C_E 2 (Z E and that F is VB on E v 

Proof. By Lemma 4.1, there exists a function O which coin- 
cides with F on the set E x and which is of bounded variation on 
the whole straight line Let E 2 be the set of the points x of E 
at which F(x) = 0(x). Then since F is, by hypothesis, measurable 
on E, the set E 2 must be measurable. Moreover, as E x (ZE t (^E, 
the function F is, with 0, of bounded variation on E 2 , and by 
Lebesgue’s Theorem 5.4, Chap. IV, and Theorem 3.3, the finite 
approximate derivative Fi t (x)=Q'(x) exists at almost all the points 
x of E 2 . 

Theorem 4.2 leads at once to the following theorem, which 
for the Denjoy integral takes the place of Lebesgue’s Theorem on 
derivability of functions of bounded variation: 

(4.3) Theorem of Denjoy-KhintcMne. A function which is 
measurable and VBG on a set is approximately derivable at almost 
all points of this set. 
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Finally, if we make use of Theorem 9.1, Chap. IV, and 
Lemma 4.1, we may complete Theorem 4.2 as follows: 

(4.4) Theorem. A function F which is VB on a set E, is derivable 
with respect to the set E at almost all points of E. Moreover, if N de- 
notes the set of the points at which the derivative F'e(x) (finite or in- 
finite ) does nqt exist, then the graph of the function F on N is of length 
zero and consequently the set of the values taken by F on N is of meas- 
ure zero-, .in symbols ^ r )'=|F[V]| = 0. 

For an extension of Theorem 4.3 to functions of two variables, vide 
V. G. Celidze [1]. 

§ 5. Functions AC and ACG. A finite function F will be 
termed absolutely continuous in the wide sense on a set E, or absolutely 
continuous on E, or simply AC on E, if given any e>0 there exists 
an t]>0 such that for every sequence of non-overlapping intervals 
{[a*, ft*]} whose end-points belong to E, the inequality >,(ft* — a*) < tj 
implies 2W*)-F(a*)|< e . * 

ft 

A function F will be termed generalized absolutely continuous 
function in the wide sense on a set E , or generalized absolutely con- 
tinuous function on E , or finally ACG on E y if F is continuous on E 
and if E is the sum of a finite or enumerable sequence of sets E„ 
on each of which F is AC. 

These definitions generalize that of functions absolutely con- 
tinuous on a linear interval (cf. Chap. Ill, §§ 12, 13) and allow us 
to generalize certain fundamental properties of the latter. We see 
at once, by the arguments of the preceding §, that every linear combina- 
tion of two functions which are AC [ACG] on a bounded set , and the 
product of such functions , are themselves AC [ACG] on this set. Further, 
every function which is AC on a bounded set E is VB on E . In fact, if F 
is such a function, there exists an rj 0 >Q such that V(F; EI)^1 for 
each interval I of length < ^ 0 . It follows that F is bounded on E . 
Let M be the upper bound of the absolute values of F on E , and 
let J be an interval containing E\ then, J is the sum of a finite num- 
ber of non-overlapping intervals J XJ J 2J ••• , J P each of which is of 
length < t] 0j and we find V(F; E)^£V(F; 2£- J*) + 2pilf<+ oo. 

k 

It follows at once that any function which is ACG on a set E 
(bounded or unbounded) is VBG on E , and therefore, by the theorem 
of Denjoy-Khintchine given in the preceding §, every function 
which is ACG on a measurable set is approximately derivable at 
almost ail points of this set. 
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Nevertheless we can construct an example of a function which is ACG 
on an interval and which is not derivable in the ordinary sense at the points 
of a set of positive measure. 

For * this purpose, let H denote a bounded, perfect, non-dense set of 
positive measure, with the bounds a and b. Let I—[a,b] and let {I n = [a n , b n ]} 
be the sequence of the intervals contiguous to E. We denote further by (> n the 
length of the largest subinterval of [a, 6] which does not overlap the first, n in- 
tervals I g , / 2 , ... , I n of this sequence. Plainly 

(5.1) lim |I n | — 0 and lim 0. 

n n 

Now let c n denote for each n=l, 2, ... , the centre of the interval I n , and 
let F be the function defined on the interval I by the following conditions: 
1° F(x)—0 for xe H; 2° jF(c /1 ) = |I /1 |-f (> n for n— 1,2, ...; 3° the function F is linear 
in each of the intervals [«„, c n ] and [c n , b n ] where n— 1, 2, ... . Thus defined, the 
function F is continuous on I by (5.1) and is AC on H and on each I n ; since 
I— it follows that F is ACG on I. 

n 

We shall show that F is not derivable at any point xe H. In fact, since F 
vanishes on H, we have 

(5.2) F(x)^0^F(x) for every xeH. 

If therefore a point x 0 is a left-hand end-point of an l n> there can be no derivative 
F'(x 0 ) since it is clear that F{x 0 ) = F~*~{x 0 ) > 0 and therefore, by (5.2), that 
F(x 0 )4 F(x 0 ). Similarly, F(x 0 ) < 0<F(x o ) if x 0 is a right-hand end-point of an 
interval I 0 . 

If, on the other hand, x 0 e H , x 0 ^a n and x 0 ±b n for n=l,2, ..., denote by i n 
the suffix of that interval of the system I 1% J 2 , ...» I n which is nearest to x„. Then 
lim i n — +oo and 0 <" \c in — x 0 \ < \Ii n \+Q n > and so, by the definition of F(x)> we have 

F(Ci n )—F(x Q )-=\It n \+e in ^-\Ii n \+e n > l c /„— *ol- Since lim c /n =x 0 , it follows that either 

_ n 

F(x 0 )^l or F(x 0 ) — l f which by (5.2), proves that F is not dervable at x 0 . 

L*et us remark, in conclusion, that a function F which i* con- 
tinuous on a set E and which is AC on a subset of E everywhere dense 
in E , is AC on the whole set E. 

§ 6. Lusln’s condition (N). A finite function F is said to 
fulfil the condition (N) on a set 1 E, if |F 1 [£T]| = 0 for every set ECE 
of measure zero (for the notation cf. Chap. Ill, p. 100). Clearly, 
a function which fulfils the condition (N) on each of the sets of a finite 
or enumerable sequence , also fulfils this condition on the sum of these sets . 

The condition (N) was introduced by N. Lusin [I, p. 109], who 
was the first to recognize the importance of this condition iu the theory of the 
integral. It is easy to see that in the domain of continuous functions the con- 
dition (N) is necessary and sufficient in order that the function should transform 
every measurable set into a measurable set (cf. H. Hademacher [1] and H. Hahn 
[I, p. 586]). Among the more recent researches devoted to the condition (N) and to 
other similar conditions (cf., below, Chap. IX) the reader should consult above 
all N. Bary [3]. 
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(6.1) Theorem. A function which is ACG on a set necessarily fulfils 
the condition (N) on this set. 

Proof. Since each set on which a function is ACG is the sum 
of a sequence of sets on which the function is AC, it will suffice 
to prove that |-F[IT]| — 0 whenever H is a set of measure zero and F 
a function AC on H. 

For this purpose, let e be any positive number. We denote, 
for brevity, by M(E) and m(E) respectively the upper and lower 
bounds of F on E, when E is any subset of H, and we write 
M (E)= m (E) — 0 in the case in which E—0. Since the function F is AC 
on if, there exists a number ?7>0suchthat (/?•/*)-— m(H-I k )]<ie 

k 

for every sequence of non-overlapping intervals {/*} which sat- 
isfies the condition < g. Now since the set H is of measure 

A 

zero, we can determine a sequence of non-overlapping intervals 
{I*} which satisfies this last condition and which covers, at the same 
time, the whole set E. Therefore, since \F[H-I k ]\^M(HI k )--m(H-I k ) 
for each k, it follows that |F[N]|^e. Hence, e being arbitrary, 
\F[H]\=Q. 

It follows from Theorem 7.8 (1°), Chap. IV, that every func- 
tion which is absolutely continuous on an interval and whose deriv- 
ative is almost everywhere non-negative, is monotone non-decreasing. 
With the help of Theorem 6.1, this result can be extended to functions 
which are ACG and we have: 

(6.2) Theorem. Every function F(x) which is ACG on an interval I 
and for which we have almost everywhere in this interval F'* v (.r)^0, 
or more generally, F + (x) ^ 0, is monotone non-decreasing. 

In particular therefore, if the approximate derivative of a func- 
tion which is ACG on an interval vanishes almost everywhere on this 
interval, then the function is a constant. 

Proof. Let e be any positive number and let G(x)=F(x) + ex. 
The function G is then ACG on the interval I (together with the 
function F), and moreover, we have G + (x)=F + (x)-\-e^ e > 0 at 
almost all the points x of I. Hence, denoting by H the set of the 
points* at which G + (*)^.'0, we have |/f| = 0, .and this implies, by 
Theorem 6.1, that |G[H)| = 0. Thus the set G[H] cannot contain 
any non-degenerate interval, and by Theorem 7.1, Chap. VI, the 
function G(x)=F(x)-{-ex is non-decreasing on I. It follows at once, 
by making e->-0, that the function F is itself non-decreasing. 
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If we analyze the preceding proof, we notice that the hypothesis of 
generalized absolute continuity of F(x) has been used only to show that 
every function of the form F(x)+ex, where e>0, fulfils the condition (N). It is 
remarkable that the condition (N) need not remain satisfied when we add a 
linear function to a function fulfilling the condition, even when this last 
function is restricted to be continuous ( vide S. Mazurkiewicz [1]). For 
this reason it is not enough to suppose in the preceding proof that the function 
F(x) merely fulfils the condition (N). 

Nevertheless, Theorem 6.2 itself does remain true for arbitrary functions 
which fulfil the condition (N). The theorems which will be proved in Chap. IX, §7, 
include a more general result, namely that every continuous function which fulfils 
the condition (N) and whose derivative is non-negative at almost all the points at 
which it exists , is monotone non -decreasing . 

We shall show (vide, below, Theorem 6.8) that for continuous 
functions of generalized bounded variation on closed sets, the con- 
verse of Theorem 6.1 is true, i. e. that in this case the condition 
(N) is equivalent to generalized absolute continuity. Similarly, for 
continuous functions of bounded variation the condition (N) is 
equivalent to absolute continuity in the ordinary sense. 

We shall begin with a lemma which will also prove useful 
elsewhere. 

(6.3) Lemma. If, for a finite function F , the inequalities F+(x)^M 
and F~(x)^ — M, where M is a finite non -negative number, hold at 
each point x of a set D, then |.F[D]|<I 

Proof. Let e be any positive number. Let D n denote for each 
positive integer n the set of the points x of D for which we have 
F(t)—F(x)^(M + e)-\t — x\ whenever |f— x\^.lfn. The sets D n evid- 
ently constitute an ascending sequence and we see easily that 
D — lim D n . 

n . . 

With each Z)„ we can associate a sequence of intervals {/*}*-_ 1,2,... 
which covers D n and fulfils the condition 

(6.4) Z\i[ n) \^\D„\+e, 

k 

and in which, further, no Z* n) has length greater than 1 In. By defini- 
tion of D„, we therefore have, for every pair x u x 2 of points of 
the inequality +e)-\x 2 -x 1 \^(M + 

so that |Z , [D„-7*' , ]| < (Jlf-f £)-|Z*" , |. In view of the inequality (6.1) 
it therefore follows that, for every n, 

\F[D „]\ < < ( M + e)^\I^\ ^ ( M + £ M|7>„| + r); 

ft ft 

and, by making first n->oo and then e-^0, wc derive |Z’( 7>]| *CM \l) . 
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(6.5) Theorem . If a function F is derivable at every point of a meas- 
urable set Dj then 

(6.6) \F[D}^f\F'(x),dx. 

b 

Proof. We may clearly assume that the set D is bonnded. 
Given any £>0, let D„ denote, for each positive integer n, the set 
of the points xeD, at which (n—l)E<^\F'(x) <«•£. We then have, 
by the preceding lemma, 

oo oo 

\F[D]\<2 |J , [D.]h <Sne- \D„\ < / \F'(x)\ dx + £• |D|, 

/i--l n-1 

and hence, e being arbitrary, the inequality (6.6). 

The formula (6.6) remains true when we replace in it the derivative F'(x) 
by any Dini derivate, provided however that we restrict the latter to be finite 
in D. The proof then becomes rather more elaborate and requires certain general 
theorems on derivates which will be established later ( vide Chap. IX, § 4). 

(6.7) Theorem. In order that a function F(x) which its continuous 
and VB on a bounded closed set E, be AC on E, it is necessary mid 
sufficient that F(x) fulfil the condition (N) on this set. 

Proof. In view of Theorem 6.1, it remains to be shown that 
the condition is sufficient. 

Suppose then that F fulfils the condition (N) on E. Let a 0 
and b 0 be the bounds of E, and let 0 denote the function which 
coincides with F at the points of E and is linear in the intervals con- 
tiguous to E. The function 0 is evidently continuous and of bound- 
ed variation, and fulfils the condition (N) on the whole interval 

K, M- 

Given any subinterval 1=00] of OoOol, ^ us denote by D 
the set of the points of I, at which the function G is derivable, and 
write H—I—D. Plainly |/Z| = 0, and therefore also |(?[Z7]|--=0. 

On the other hand, since the interval with the end-points G(a) 
and G(b) is contained in G[I], we have by Theorem 6.5 

b 

I G(b) -G(a) | < \G[D]\ + \G[H ]| = |<?[D]| < f\G\x)\ dx. 

a 

Since this inequality is valid for every subinterval I— [a, ftj 
of [a,,, ft 0 ] an d since by Theorem 7.4, Chap. IV, the derivative G’(x) 
is summable on [a 0 , 6 0 ], it follows that the function G is AC on [a 0 , 6 0 ], 
and therefore that F is AC on the set E, where F and G coincide. 
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It is easy to see that the same argument leads to a more general theorem: 
in order that a continuous function F which is continuous on an interval /„ be ab- 
solutely continuous on this interval, it is necessary and sufficient that F fulfil the 
condition (N ) on 7, and that its derivative exist almost everywhere on J 0 and be sum- 
mable on Z 0 . This theorem will again be generalized in Chap. IX, §7. 

(6.8) Theorem. In order that a junction F which is continuous 
and VBG on a closed set E be ACG on E, it is necessary and sufficient 
that F fulfil the condition (N) on this set. 

Proof. In view of Theorem 6.1, we need only prove the con- 
dition (N) sufficient. Now, since F is VBG on the set E, this set 
is expressible as the sum of a sequence of bounded sets { E „ } such 
that the function F is VB on each E n . By continuity of F on the 
closed set E, we may suppose (cf. § 4, p. 221) that each set E„ is 
closed. Since further F fulfils the condition (N) on E, it follows 
from Theorem 6.7 that the function F is AC on each E„, and there- 
fore ACG on E. 

§7. Functions VB* and VBG*. We shall denote by Y,(F;E) 
and term strong variation of a finite function F on a set E, the 
upper bound of the sums £0 (F-,Im) where {/*} is any sequence of 

A 

non-overlapping intervals whose end-points belong to E (in accord- 
ance with Chap. Ill, p. 60, O (F;I*) denotes the oscillation of F 
on the interval I*). If V,(F, !?)< + oo, the function F will be said 
to be of bounded variation in the restricted sense on the set E, or 
VB, on E. 

Following the order of the definitions of § 4, we shall say 
further that a finite function is of generalized bounded variation in 
the restricted sense, or simply, is VBG, on a set E, if E is the sum 
of a finite or enumerable sequence of sets on each of which the 
function is VB,. 

In the special case in which the set E is a closed interval, we 
clearly have Y t (F-,E) — Y( L F-,E)=W(F',E). It is easy to see that 
we always have V(F;F)^V,(F;F); so that every function which 
is VB, on a set, is VB on this set, and consequently, every function 
which is VBG, on a set, is VBG on this set. We next observe (by 
using trivial inequalities for the VB, case, and thence passing on 
to the VBG, case) that every linear combination, with constant coef- 
ficients, of two functions which are VB, [VBG,], and also the product 
of two such functions, are themselves VB, [VBG,]. 
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Let us observe that, for a function, the property of being VB, VBG, AC, 
or ACG, on a set E depends solely on the behaviour of the function on E; whereas 
the property of being VB* or VBG* on E depends on the behaviour of the func- 
tion on the whole of an interval containing the set E. In other words, of two 
functions which coincide on a set E, one may be VB* or VBG* on E and the 
other not. The same remark applies to the property of being AC* or ACG* with 
which we shall be concerned in the next §. 

We have remarked in § 4, p. 221, that a function which is 
continuous on a set E and which is VB on an everywhere dense 
subset of E, is necessarily VB on E. A similar result is true for func- 
tions which are VB*, the assumption of continuity of the given 
function being now superfluous. We have in fact: 

(7.1) Theorem. Every finite function F which is VB* on a bounded 
set E is equally so on the closure E of this set. 

Proof. Let a and b denote the bounds of E and therefore 
also of E. Let a—a 0 <a l <...<a„—b be any finite sequence of 
points of E\ we write l = [a,b] and I*=[a*-i, a*] for Zc=l, 2, ... , n. 
We shall say that an interval I* is of the first class if it contains 
points of E, and otherwise of the second class. The intervals I x 
and I„ are clearly of the first class, and we see easily that, if an 
interval I* is of the second class, then both the adjacent intervals 
I k - 1 and Ik+i are certainly of the first class. 

Let us denote by 1 = i 0 <H l <....<.i r —n the suffixes of the 
intervals I* of the first class and by j 0 <h<—<js those of the 
second. With each interval L of the first class we associate a point 
bheli • E and we write Jh—[bh-\,b h ] for ft=l,2, ... ,r. It is easy 

"h 

to see that 

ZO(F-, I i )^0(F;I i ) + 0(F ] IJ + 2-Z0(F- 1 J h ) 

h -0 h h 1 

and 

SO(F-,L)^ZO(F-,J h ). 

/i 0 h *-=1 

Hence, SO (F-, /*) ^ 3-Z0(F-,J„)+2-0(F-, Z)<3-[V,(f ; E)+0(F; I)), 

b=\ _ h 1 

and therefore V,(f; E)^3-[V,(F-,E) + 0(F-,I)]<+3o. This completes 
the proof. 
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(7.2) Theorem. If a function F is \ r BG_ on a set E, then F is 
derivable at almost all points of this set ; and further if N denotes the 
set of the points x of E at which the function has no derivative, finite 
or infinite, then |J T [2iT]| =yt(B(_F;.Af)} = 0. 

Proof. We may clearly suppose that the set E is bounded 
and that the function F is VB* on E. Moreover, by Theorem 7.1, 
we may suppose that the set E is closed. 

Let therefore a and b denote the bounds of E on the left and 
on the right, and {I„} n -\x. . the sequence of the intervals contiguous 
to E. Writing m„ and M„ respectively for the lower and upper 
bounds of F on I„, we define two functions m(x) and M (x) on 
[a, b] making m(x) = m„ and M{x)—M„ for xel° n where n=l, 2 , ... , 
and m(x)= M(x) — F(x) for xeE. The two functions m(x) and M(x) 
thus defined are plainly of bounded variation on the whole interval 
[a, b] and coincide with F(x) on the set E. Therefore, denoting 
t>y N 0 the set of the points xe E at which either one at least 
of the (finite or infinite) derivatives M'(x) and m'(x) does not 
exist, or both exist without being equal, we find by Theorem 9.1, 
Chap. IV, that 

(7.3) \F[N o ]\ = \A{B(F-,N o ))\ = 0 . 

On the other hand, m(x)=F(x)— M(x) at every point x of E, while 
m(x)^F(x)^M (x) on the whole interval [a, ft]. It follows that 
the derivative F'(x)~ m'(x) = M'(x) exists at each point x of E, 
except at most those of the set N 0 which is subject to the rela- 
tion (7.3). Finally, since the functions m(x) and M (x) are deriv- 
able almost everywhere on the interval [a, ft], the function F 
must be derivable at almost all points of E, and this completes 
the proof. 

Theorem 7.2 (for continuous functions and in a slightly less complete form) 
was first proved by Denjoy and by Lusin, independently. It plays in the theory 
of the Denjoy -Perron integral (vide, below, Chap. VIII) apart similar to that of 
Lebesgue’s Theorem (Chap. IV, § 5) in the theory of the Lebesgue integral. A cor- 
responding part is played in the theory of the Denjoy-Khintchine integral by 
Theorem 4.3. But the latter is stated in terms of approximate derivation (cf. 
the example of p. 224) whereas Theorem 7.2, which requires no modification 
of the notion of derivative, is, for functions of a real variable, a direct generaliza- 
tion of Lebesgue’s Theorem. 
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§ 8. Functions AC* and ACG*. A finite function F is said 
to be absolutely continuous in the restricted sense on a bounded set E , 
or to be AC* on E , if F is bounded on an interval containing E and 
if to each e>0 there corresponds an rp> 0 such that, for every finite 
sequence of non-overlapping intervals {/*} whose end-points belong 
to Ej the inequality £\I k |<ij implies ^0(F y I k )<e. 

k k 

A function will be termed generalized absolutely continuous on 
a set E y or ACG* on E , if the function is continuous on E and if 
the set E is expressible as the sum of a sequence of bounded sets 
on each of which the function is AC*. 

In the case in which the set E is an interval, the class of func- 
tions AC* on E coincides with that of the functions which are ab- 
solutely continuous on E in the ordinary sense. Every function 
which is AC* on an arbitrary set E is AC on E , and every function 
which is ACG* on E is ACG on E. On the other hand, any function 
which is AC* on a bounded set is VB* on this set , and therefore, any 
function which is ACG* on a set is VBG* on this set . To see this, 
let F be AC* on a bounded set E . We can then determine a positive 
number rj 0 such that 1 for every interval I of length 

less than rj 0 . Let J be the smallest interval containing E y let M be 
the upper bound of \F(x)\ on J, and suppose J expressed as the 
sum of a finite number of non-overlapping intervals J ly J 21 ... , J p 
each of length less than t ] 0 . We shall then have 

V,(F; E) < E-J k ) + 2 Mp < (2M+l)p < + oo, 

and this shows that the function F is VB* on E. 

Thus a function which is AC, on a bounded set E is both AC 
and VB, on this set, and similarly a function which is ACG, on E 
is both ACG and VBG, on E. The converse also is true, provided 
that the set E is restricted to be closed (vide, below, Theorem 8.8). 
Instead of giving a special proof of this result, we shall establish 
some more general theorems about the relations between the notions 

VB, AC, VB*, AC,, VBG, ACG, VBG, and ACG,. 

(8.1) Lemma. Let E denote a bounded closed set, {J*} the sequence 
of the intervals contiguous to E, and I 0 the smallest interval containing E. 
Then, for any function F which is finite on I 0 , we have 

(8.2) O(F-,I 0 )^V(F-,E)+2-2O(F-,J„). 

k 
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Proof. Let M,m and M 0 , m 0 be the bounds (upper, lower) 
of F, on E and on I 0 respectively. Let M'o be any finite number 
less than M 0 , and x 0 a point of I 0 such that M' 0 ^F(x 0 ). If we have 
x 0 eE, this inequality implies M'o^M, while if x 0 belongs to an 
interval, J ko say, of the sequence {J k !, M'o^ M+0(F-,J k J. Hence 

(8.3) M 0 ^M+£O(F-,J k ), 

k 

and similarly 

(8.4) m 0 >m-ZO{F-,J k ). 

k 

On subtracting (8.4) from (8.3), we obtain, since M — m^V(F; E), 
the relation (8.2). 

(8.5) Theorem • In order that a function F which is VB [AC] 
on a bounded closed set E , be VB* [AC*] on E , it is necessary and suf- 
ficient that the series of its oscillations on the intervals contiguous 
to E be convergent . 

Proof. The necessity of these conditions is obvious (cf. 
above p. 231); we have therefore only to prove them sufficient. 

Let then {J k ) denote the sequence of the intervals contiguous 
to E } and suppose that 

(8.6) ZO (F-,J k )< + ™. 

k 

We shall consider the two cases separately: 

1° The function F is VB on E , i. e. V(F-,E)< + oo. Then 
by Lemma 8.1, we have for every sequence {/„} of non-overlapping 
intervals whose end-points belong to E , 

20(F-,I n )^ ZV(F‘, E-I n ) + 2 SO(F-,J k )^ Y(F; E) + 2 -ZO(F-, J k ). 

n n k h 

It follows by (8.6) that V*(F-,E)< + oo, i. e. that the function F 
is VB. on E. 

2° The function F is AC on E. Then, given any e>0, 
there exists a number t] > 0 such that, for every sequence of non- 
overlapping intervals {I„} whose end-points belong to E, the in- 
equality 2\In\<y implies £V(F-, E-I„) <e/2. Now by (8.6), there 

n n 

exists a positive integer fc 0 such that 

co 

(8.7) 

*=*o+l 

Denote by r/ 0 the smallest of the fc 0 4-l numbers rj, IJJ, | J s |, ..., |J*J, 
and let {!„} be any sequence of non-overlapping intervals with end- 
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points in E, the sum of whose lengths is less than rj 0 . None 
of these intervals /„ can contain one of the first k 0 intervals of the 
sequence {J„}, and it follows from (8.7) and from Lemma 8.1, that 
£0(F;I„)<^£V(F-,E-I„)-\-eI2^£. Therefore the function F is AC, 

n n 

on E, and this completes the proof. 

(8.8) Theorem. In order that a function F be AC* [ACG*] on a bound- 
ed closed set E, it is necessary and sufficient that F be both VB, and 
AC [VBG, and ACG] on E. 

Proof. The necessity of these conditions is obvious, so that 
we have only to prove them sufficient. 

Now, if the function F is both YB, and AC on E, it follows 
at once from Theorem 8.5 that F is AC* on E. If on the other hand, 
F is YBG, and ACG on E, we can express the set E as the sum of 
a sequence of sets {E„\ on each of which F is both VB, and AC. 
Since F is ACG, and so continuous, on the set E, which is by hypo- 
thesis closed, F is AC on the closure E n of each E„. Similarly, by 
Theorem 7.1, F is VB* on each E„. Therefore by what has just been 
proved, F is AC, on each of the sets E„ and so, ACG, on the set E. 

Theorem 8.8 ceases to hold if we remove the restriction that the set E is 
closed. Let E be the set of irrational points, and 2 the sequence of ra- 

tional points, of the interval [0, 1]; and let F(x)= 0 for xeE> and F(a n )~ l/2 n 
for n=l,2, ... . The function F thus defined is evidently VB* and AC on E. To 
show that F is not AC*, nor even ACG*, on E> suppose that the set E is the sum 
of a sequence of sets {E n ) on each of which F is AC*. By Baire’s Theorem (Chap. II, 
Theorem 9.2), one at least of the sets E n would be everywhere dense in a (non- 
degenerate) subinterval of [0, 1]. But this is plainly impossible, since every sub- 
interval of [0, 1] contains, in its interior, points of discontinuity of the function F . 


§ 9. Definitions of Denjoy-Lusin. The definitions which 
we have adopted in this chapter for the classes of functions VBG, 
ACG, VBG, and ACG, are based on the ideas of A. Khintchine [3]. 
Rather different definitions were given by N. Lusin [I] and 
A. Denjoy [6], which are equivalent to those of Khintchine when 
we restrict ourselves to continuous functions. We give them here, 
in the form of necessary and sufficient conditions, in the following 
theorem. 

(9.1) Theorem. In order that a function which is continuous on 
a closed set E, be VBG [VBG,, ACG, ACG,] on E, it is necessary 
and sufficient that every closed subset of E contain a portion on which 
the function is VB [VB,, AC, AO,]. 



234 CHAPTER VII. Functions of generalised bounded variation. 

Proof. We shall deal only with the VBG ease, the proof for 
the other three cases being quite similar. 

1° The condition is necessary. Let F be a function which 
is continuous and VBG on E . We can then express the set E as 
the sum of a sequence of sets { E n } on each of which the function F 
is VB and, by continuity of F, the sets E n may be supposed closed. 
Then by Baire’s Theorem (Chap. II, § 9), every closed subset of E 
has a portion P contained wholly in one of the sets E n . The func- 
tion F , which is VB on each E n , is thus certainly VB on P. 

2° The condition is sufficient. Suppose that F is a continuous 
function on E and that every closed subset of E contains a portion 
on which F is VB. Let {!„} be the sequence of all the open intervals 
I with rational end-points such that F is VBG on E l. Let Q=2JE-I n 

n 

and H=E—Q . Plainly F is VBG on Q and we need only prove 
that the set H is empty. 

Suppose therefore that H 4- 0. Since H is clearly a closed set, 
there exists, by hypothesis, an open interval J such that HJ 4=0 
and that the function F is VB on H J. We may evidently assume 
that the end-points of J are rational. Therefore, the function P, 
which is VBG on the set Q , is also VBG on the set EJ(ZHJ +Q. 
This requires J to belong to the sequence of intervals { I n } and we 
have a contradiction, since the set J?, by definition, has no points 
in common with any of the intervals 

Theorem 9.1 shows in particular that every continuous function which 
is VBG on an interval I is at the same time VB on some subinterval of Z. It fol- 
lows that for every continuous function which is VBG on an interval Z, there 
exists an everywhere dense system of subintervals on each of which the function 
is almost everywhere derivable, although this function may, as shown in § 5, 
have no derivative at the points of a set of positive measure. 

§ 10. Criteria for the classes of functions VBG,, ACG„ 
VBG and ACG. A series of theorems enabling us to distinguish 
certain types of functions of generalized bounded variation and cer- 
tain types of generalized absolutely continuous functions, are due 
to A. Denjoy [6]. 

(10.1) Theorem . If F(x) is a function which fulfils at all points 
of a set , except at most those of an enumerable subset , one of the 
inequalities 

(10.2) F(x)<Z + oo or 

then the function F(x) is VBG* on this set. 
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Proof. It is enough to show that the set E of the points at 
which we have, say, F(.r) < + oo, is the sum of an enumerable in- 
finity of sets on each of which F is VB*. 

For any positive integer n, let E„ denote the set of the points x 
of E such that for every t, 

(10.3) 0<|<— a?|<l/» implies [F(t) —F{x)]f(t—x)^n. 

Further, for each integer i, let E' n denote the part of E„ sit- 
uated in the interval [i/n, (i+l)/n], and a 1 ,,, b‘„ the lower and 
upper bounds of those of the E'„ which are not empty. We have 

oo oo -f 00 

clearly E=SE„=S SK. 

n~ 1 n— 1 / 00 

Let now F„(x)=F(x)—nx. For every point xeE„ and for 
every point t which fulfils the first of the inequalities (10.3), we 
then have [F„(t) — F n (x)]/(t — x) ^ 0. In particular, given any pair 
of points x x ,x 2 (where x 1 ^.x 2 ) of E'„, we obtain 

(10.4) F n (a\,)7?F n (x x ) ^ F„(x 2 ) >F„(b i n ), 

and for every t such that x x ^t^x 2 we find that F^x^^Fnit^FniXj). 
This last relation implies that, for every interval I— [o, j8] whose 
end-points belong to the set E'„, we have O (F n -,I) — F„(a)—F„(^), 
and therefore by (10.4), for every sequence !//— [<*/,/?,] } of such 
intervals (which do not overlap), 

SO (F„; Ij)=S[Fn(aj)-F„(pj)] ^F„(a! n )-F n (b' n ). 

j J 

The function F n (x), and therefore also the function F(x)=F„(x) + nx, 
is thus VB* on every set E‘„ and this completes the proof. 

(10.5) Theorem. If F(x) is a function which fulfils at all points 
of a set E , except , perhaps, at those of an enumerable set, one at least 
of the conditions 

(10.6) — oo<F + (x)^F+(x)<+oo or —oo<F~(x)^F~(x)<+oo, 

then the set E is the sum of an at most enumerable infinity of sets 
on each of which the function F is AC*. 

If, therefore , we are given further that F(x) is continuous on E, 
then F(x) is ACG* on E. 
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Proof. It is enough to show that if at every point x of a set A 
the two extreme right-hand derivates F^(x) and F + (x) are finite, 
then A is expressible as the sum of an at most enumerable infinity 
of sets on each of which the function F is AC,. 

Let A„ denote, for each positive integer n, the set of the points 
xeA such that, for every t, 

(10.7) 0^.t—x^,lln implies \F(t)—F(x)\^n-(t—x): 

and, for each integer i, let A' n denote the common part of A„ and 

oo oo-l-oo 

of the interval [i/n, (t+l)/»]. Plainly A=EA n =£ HA!,. 

n— 1 n— 1 i - - oo 

Now, if I=[x 1 ,x 2 ] is any interval whose end-points belong to 
An, we have, for every tel, the inequality arj^l/n, and so, 

on account of (10.7), iJF’(f)— F(x 1 )\^,n-(t— Jj)^«-j/|. This gives us 
0(P; I) <; 2n-|Z|; and therefore for any finite sequence {I/, of such 

intervals, £0(F-,Ij)^2n-2J\Ij\. It follows that the function F 

j J 

is AC, on each of the sets A 1 ,,, and this completes the proof. 

Theorem 10.5 shows, in particular, that every function which is continuous 
and everywhere derivable (even only unilaterally) is ACG*. Nevertheless as we 
saw in Chap. VI, p. 187, such a function need not be absolutely continuous. 

In view of Theorem 7.2, we may state also the following corollary of Theo- 
rems 10.1 and 10.5: A function F which fulfils at each point of a set E one at least 
of the inequalities (10.2) or (10.6), is derivable at almost all points of E. In part- 
icular therefore, the set of the points at which a function has (on one side at least ) 
its derivative infinite , is of measure zero. These statements will be generalized 
in Chap. IX, §4. 

Theorems 10.1 and 10.5 contain sufficient conditions in order that a func- 
tion be VBG* or ACG*, but these conditions are clearly not necessary. Never- 
theless, by employing the notion of derivates relative to a function (cf. Chap. IV, 
p. 108), it is easy to establish conditions similar to those of the preceding theorems, 
the conditions being this time both sufficient and necessary. Thus, as shown 
by A. J. Ward [3]: 

In order that a finite function F be VBG* on a set E , it is necessary and suf- 
ficient that there exist a bounded increasing function U such that the extreme deriv- 
ates of F with respect to U are finite at each point of E except , perhaps , those of 
an enumerable set. 

1° In order to establish the necessity cf the condition, let us suppose first 
that the function F is VB* on E. In view of Theorem 7.1 we may assume that 
the set E is bounded and closed. Let [a, 5] be the smallest interval containing E , 
and, for each point x of the interval [a, 6], let V^x) and V t (x) denote the strong 
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variations of F (cf. §7) on the parts of E contained in the intervals [a y x] and 
[x,b] respectively. Finally for each x of [a, b], let V (x)~\\(x) — V % (x) + x. The 
function V thus defined is increasing and finite on [a, 6], and can therefore be 
continued as a bounded increasing function on the whole straight line li x . We 
see at once that throughout the set E y except at most at the points a and 6, the 
derivates of the function F with respect to V are finite and indeed cannot ex- 
ceed in absolute value the number 1. 

Suppose now given any function F which is VBG* on E. The set E is then 
expressible as the sum of a sequence {E n ) of sets on each of which the function 
F is VB*. Consequently, by what has just been proved, there exists for each n 
a bounded increasing function V n with respect to which the function F possesses 
finite derivates at each point of the set E n except at most at the bounds of 
this set. Therefore, denoting by M n the upper bound of \V n(x)\ and writing 
U (x)-^Yn(x) /2 n M n , we see at once that the function U thus defined is in- 

n 

creasing and bounded and that at each point of E y except perhaps those of an 
enumerable set, the function F possesses finite derivates with respect to U . 

2° The condition is sufficient. Let F be a finite function having at each 
point of E y except perhaps at those of an enumerable subset, finite derivates 
with respect to a bounded increasing function U. For each positive integer n, 
let E n denote the set of the points x of E for which the inequality t—x\^l/n 
implies \F(t) — F(x)\^n*\U(t) — U(x)\; and let each E n be expressed as the sum 
of a sequence {E l n )t= ,2. .. of sets of diameter less than 1/n. We see easily (as in 
the proof of Theorem 10.1) that the function F is VB* on each set E f n , and 
since the sets E I, plainly cover all but an enumerable subset of E y it follows at 
once that the function F is VBG* on E. This completes the proof. 

If we analyze the first part of the above argument, we see that if the func- 
tion F is VBG* on E and moreover bounded on an interval containing the set E 
in its interior, there exists an increasing bounded function U with respect to 
which the function F has its derivates finite at each point of E. Moreover, if 
the function F is continuous on an interval containing E in its interior, the func- 
tion U may be defined in such a way as to be itself continuous (cf. the proof 
of Lemma 3.4, Chap. VIII). Finally, it can be shown that in order that a junc- 
tion F be ACG* on an open interval I, it is necessary and sufficient that there exist 
an increasing and absolutely continuous function with respect to which the function 
F has its derivates finite at every point of I. 

(10.8) Theorem. If at every point x of a set E , except perhaps at 
the points of an enumerable subset , a function F fulfils any one of 
the inequalities 

(10.9) P + (a;)<+ 30 > -^~(®)<+°°> F~(x)>-°°, 

(10.10) l.|,(*)<4' 00 ! ^»p(*)>— °°> 

then F is VBG on E. 
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Proof. We need only consider the case of the first of the 
inequalities (10.9) and that of the first of the inequalities 

(10.10). It is therefore sufficient to show that each of the sets 
^4 = E[P + (a;)<-t-.oo] and B=E[F av (x)<. +oo] is expressible as 

X X 

the sum of an enumerable infinity of sets on each of which F is 
of bounded variation. 

Consider first the set A. Given any positive integer n, let A„ 
denote the set of all the points xeA such that, for every t, 


(10.11) 0<^t— x^l/n implies F(t)—F(x)^.n-(t — x), 


and by A‘„, for each integer i, the part of A„ contained in the 
interval [i/n, (i+l)/n]. Let F„(x)=F(x) — nx. 

For every pair x lt x 2 of points of A'„, where x 1 ^x 2 , we have 
0^x 2 — x 1 ^l/n, and so, by (10.11), F(x 2 ) — F(x 1 )^n-(x 2 — x x ), i. e. 
F n (x 2 )— F„(x l )^.0. The function F n (x) is thus monotone non-in- 
creasing on each set A‘„, and it follows that A‘„ is expressible as the 
sum of a sequence of sets {A‘n)j=\x... on each which F n (x) is mono- 
tone and bounded. The function F(x) = F„(x) \-nx is then plainly of 


bounded variation on each of the sets and moreover we have 

co oo -{-oo co 

a=Za,.=Z Z Za” 

n— 1 n~~ 1 i - - oo jr— 1 

Consider now the set B. From the definitions of approximate 
upper limit and approximate upper derivate (cf. § 3, p. 220), it 
follows at once that to each point xeB we can make correspond 

F(£)-F(x). 


->» 


has the 


a positive integer n such that the set E ^ ^ 

point a; as a point of dispersion. Therefore, denoting by B„ tbe set 
of the points xeB such that the inequality 0 h 'A 1 \n implies both 
the inequalities 


(10.12) |E[F(£)— F(x)^n-($— x)-, x^£^x+h]\\^h/3 
and 

(10.13) \E>[F{x)-F£)^n-(x—*y, x-h^^x]\^h/3, 

I 

OO { 

we have B = Z Sn - We denote further, for every integer i, by B„ 

n—\ 

the part of B„ contained in the interval [i/n. (t-f-l)/n] and we write 
as before, F„(x)—F(x)—nx. 
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The main part of the proof consists in showing that, for every i, 
the function F„(x) is monotone on B'„. 

For this purpose, let x v x 2 be any pair of points of a B'„, and 
let x x <x 2 . We plainly have 0 <x 2 — x x ^ 1/n, so that by writing 
x=x x and h—x 2 — x x in (10.12), we obtain 

|E[F (£)— F («i) > n- (£— x x ); x x < f<a 2 ]| < {x 2 —x x )IS. 

Similarly, from (10.13) with x=x 2 and h=x 2 —x l , we derive 
\E[F(x 2 )-F(£)^n-(x 2 -fr, x x < £ < x 2 ]\ < (x 2 -x x )/3. 

The two inequalities thus obtained show that the interval 
[a?!, x 2 ] contains a point f 0 such that 

F (£o)— F (*i)< n-(Zo— x i) and ^W-f(§oX»>’Mo)' 

By adding these two inequalities term by term, we obtain 
F(x 2 ) — F(x x ) <in-(x 2 —x ,), and so finally F n (x 2 )~ F„(x t ) < 0. 

We have thus shown that the function F n (x) is monotone de- 
creasing on each B'„. It follows that B‘„ is expressible as the sum of a se- 
quence of sets {B'n)j^\;i,... on each of which F„(x) is monotone and 
bounded, and on which the function F(x)=F n (x)-\-nx is 'therefore 

oo oo oo oo 

of bounded variation. Moreover, we have B=£ B n =£ £ Xb'J. 

n 1 n 1 h- oo j- 1 

This completes the proof. 

On account of Theorem 4.2, it follows immediately from Theorem 10.8 
that any measurable junction which satisfies one of the inequalities (10.9) or (10.10) 
at each 'point of a set E, is approximately derivable at almost all points of E. This 
proposition will be generalized and completed in Chap. IX (§§ 9 and 10). 

(10.14) Theorem . If two extreme approximate derivates on the 
same side are finite for a function F(x) at every point of a set K, 
except at most in an enumerable subset , then the set E is the sum of 
a sequence of sets on each of which F(x) is absolutely continuous . 

Consequently , if the function F(x) is further given to be contin- 
uous on E , then F(x) is ACG on E . 

Proof. It is clearly enough to show, for instance, that the 
set A—E [~ oo < Ftp(x) ^.Ftp(x) < -foo] is the sum of an at most 

X 

enumerable infinity of sets on each of which F is AC. 
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Now we can make correspond, to each point xe A, a positive 
integer n such that x is a point of dispersion for the set 
E[\F(^)—F{x)\^n-(^—x)] (cf. § 3, p. 220). Hence, denoting by A n 

the set of the points xe A such that, for every h , the inequality 
0^h^2jn implies 

(10.15) |E[|E(£)— F(x)\^n-(g— x)-, x^£*^x + h]\^h/4, 

oo 

we have A=£A n -, and, denoting as before by A l n (for each integer i) 

n =- 1 

the part of A n contained in the interval [i/n, (t+l)/n], we obtain 

OO -f-°° / 

A=I I'A 1 ,,. 

n— 1 i — oo 

Consider now any two points x x and x 2 of A \ x , where x 1 <Cx 2J 
and let x 3 =2x 2 —x v 

We have, on the one hand, 0 <x 3 — x l = 2-(x 3 — x 2 )^2/n, so 
that by writing x=x 1 and h = x 3 ~x 1 in (10.15), we obtain the 
inequality 

x 1 ^^x 3 \\^(x 3 -x 1 )!^=(x 3 -x i )!2, 

and a fortiori 

(10.16) |E[lE(f)— F(x 1 )\^n (S~ a?!); x 2 ^$ ^x 3 ]\^ (x a —x 2 )/2. 

On the other hand, we have — # 2 =# 2 ---# 1 ^l/w, and so 

by (10.15) with x=x 2 and h=x 3 —x 2 , we find 

(10.17) |E[lE(c)— F(x z )\^n-{$— x 2 ); ar 2 < £ < a? 3 ]j (x 3 — ® 2 )/4. 

The inequalities (10.16) and (10.17) show that there exists 
a point £ 0 in [x 2 , x 3 ] such that we have at the same time 

l-f’(^o)— • ^ K)! < »*(lo— ■ ■x 1 )Kn-(x a —i x 1 )=2n-(x i ~, x t ), 
\F(? 0 )—F (ar 2 )| < n.{£ 0 —x 2 ) < n • (x 3 — x t ) = 2n- (x 2 — a^), 

and this requires | F(x 2 ) — F(x x )\ < 4w-|x 2 — icj. This last inequality 
is thus established for every pair of points x u x 2 of any one of the 
sets A‘„, and it follows at once that F is AC on each of the sets A'„. 
This completes the proof. 

Theorem 10.8 shows in particular that a continuous function which is 
everywhere approximately derivable, even unilaterally, is necessarily ACG. 

In Chap. IX, § 9, we shall give two further criteria for a function to 
he ACG* or ACG. 



CHAPTER VIII. 


Denjoy integrals. 

§ 1. Descriptive definition of the Denjoy integrals. 

We shall base the study of the Denjoy integrals on their descriptive 
definition. The essential ideas have already been sketched in 
Chap. VII, § 1. We now complete them further as follows. 

A function of a real variable / will be termed 9-integ ruble 
on an interval I=[a,b] if there exists a function F which is ACG 
on I and which has / for its approximate derivative almost every- 
where. The function F is then called indefinite 9-integral of / on I. 
Its increment F(I) = F(b) —F(a) over the interval I is termed 
definite 9-integral of f over I and is denoted by 

& 

( 9 ) J f(x) dx or (9) j f(x) d.v. 

1 a 

Similarly, a function / will be termed 9 t -integrable on an 
interval / = [«,&], if there exists a function F which is ACG, on I 
and which has f for its ordinary derivative almost everywhere. 
The function F is then called indefinite 9^- integral of / on I; the 
difference FU)—F(b)—F(a) is termed definite 9,- integral of / over I 

ft 

and denoted by (9*) j f(x) dx or by (:?,)/ f(x) d.r. 

i « 

For uniformity of notation, the Lebesgue integral will fre- 
quently be called £-integral. 

The integrals 9 and i ?, are often given the names of Denjoy 
integrals in the wide sense, and in the restricted sense, respectively. 
The first of these is also termed Denjoy-Khintchine integral (cf. 
Chap. VII, §1), and the second, Denjoy-Perron integral (for the 
latter, as we shall see below in § 3, is equivalent to the Perron in- 
tegral considered in Chap. VI). 
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It is immediate, by Theorem 6.2, Chap. VII, that when a func- 
tion is 9- or SVintegrable on an interval, its definite Denjoy 
integrals are uniquely determined on this interval (its indefinite 
integrals being determined except for an additive constant). More 
generally, if two functions are equal almost everywhere and the one 
is integrable in the Denjoy sense (wide or restricted) on an interval 7 0 , 
then so is the other and the two functions have the same definite in- 
tegral over I 0 . Another immediate consequence of the preceding 
definitions is the distributive property for Denjoy integrals. Thus, 
if two functions g and h are 9- or 9 ^-integrable on an interval 7, the 
same is true of any linear combination ag-\-bh of these functions , and 
we have 

(9) / [ a'g(x) + b-h(x )] dx—a • (£>) J g(x)dx + b-(9) f h(x) dx. 
i ii 

It follows from Theorem 10.14, Chap. VII, that a continuous 
function which is approximately derivable at all points except , perhaps , 
at those of an enumerable set , is necessarily an indefinite 9-integral 
of its approximate derivative . Similarly, by Theorem 10.5. Chap. VII, 
a continuous function which is derivable (in the ordinary sense) at 
all but an enumerable set of points , is an indefinite 9 ^-integral of 
its derivative . The process of integration 9+ therefore includes that 
of Newton (cf. Chap. VI, § 1). The fundamental relations between 
the Denjoy and Lebesgue processes are given in the following 

(1.1) Theorem. 1° A function f which is 9^-integrable on cm 
interval I is necessarily also 9 -integrable on I and we have 

(2>) f f dx = (£>,) / / dx. 
i i 

2° A function f which is £- integrable on an interval I is 

necessarily 9 ^-integrable on I and we have ( 9 J f fdx == f f dx. 

i i 

3° A function which is 9-integrable and almost everywhere non- 
negative on an interval I is necessarily £-integrable on 7. 

Proof. 1° and 2° follow at once from the; definitions of the 
Denjoy integrals and from the descriptive; definition e>f the Lebes- 
gue integral (Chap. VII, § 1). As regards 3°, it is sufficient to recall 
the fact that, in view of Theorem 6.2, ('hap. VII, a function which 
is ACG and whose appre>ximate derivative is almost everywhere 
non-negative, is necessarily monotone non-decreasing, and therefore 
its derivative is Hummable. 
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Part 3° of Theorem 1.1 shows that for functions of constant 
sign the Denjoy processes are equivalent to that of Lebesgue (cf. 
Theorem 6.5, Chap. VI, for the corresponding result concerning the 
Perron integral). Hence, we derive the following further extension 
of Lebesgue’s theorem on term by term integration of monotone 
sequences of functions (Chap. I, Theorem 12.6). 

(1.2) Theorem. Given a non-decreasing sequence { f n } of functions 
which are 9-integrable on an interval I and whose 3-integrals over 1 
constitute a sequence bounded above, the function f(x) = lim f n (x) is 

n 

itself , necessarily , ^-integrable on I and we have 

(3) I f (x) dx — lim (9) / f n (x)dx. 

1 « / 

Exactly the same is true with 9* in place of 9 in the hypothesis 
and conclusion. 

Proof. This theorem reduces at once to the theorem of Le- 
besgue just referred to, for we need only consider in place of the 
functions f n , the functions f n — /,, which are integrable in the Denjoy 
sense and non-negative, and which are therefore integrable in the 
Lebesgue sense on account of Theorem 1.1 (3°). 

We shall show later on (Chap. IX, § 11) that the extreme 
approximate derivates of any measurable function are themselves 
measurable functions. This includes the result that any function 
which is 9-integrable is measurable. In the meantime we give an 
independent proof of this last assertion. 

(1.3) Theorem. A function which is 9-integrable is necessarily 
measurable and almost everywhere finite. 

Proof. Let / be 9-integrable on an interval I and let F be 
its indefinite integral. The function F is therefore ACG on I, so 
that I is the sum of a sequence {!?„} of closed sets on each of which 
F is AC. By Lemma 4.1, Chap. VII, there exists for each n a func- 
tion F„ of bounded variation on I, which coincides with F on E n . We 
therefore have almost everywhere on E„ the relation f(x)= F ap {x)=F' n {x)', 
and since the derivative of a function of bounded variation is meas- 
urable and almost everywhere finite, it follows that / isj meas- 
urable and almost everywhere finite on each E„ and consequently 
on the whole interval I. 



244 


CHAPTER VIII. Denjoy integrals. 


Finally, let us mention as an immediate consequence of The- 
orem 9.1, Chap. VII, 

(1.4) Theorem . If a function f is 9-integrable on an interval Z 0 , 
then every closed subset of I 0 contains a portion Q such that the function f 
is summable on Q and such that the series of the definite 9- integrals 
of f over the intervals contiguous to Q is absolutely convergent. 

Similarly , if the function f is 9^-integrable on h, then every 
closed subset of I 0 contains a portion Q such that the function f is sum- 
mable on Q and such that the series of the oscillations of the indefinite 
9+~integrals of f over the intervals contiguous to Q is convergent . 

§ 2. Integration by parts. We have already observed 
(Chap. VI, p. 210) that a slight modification of the definition of 
Lebesgue-Stieltjes integral leads to an indefinite integral which is 
an additive function of an interval. As this modification will 
be useful to us in the present §, we now formulate it explicitly. 

Given a finite function g integrable in the Lebesgue-Stieltjes 
sense with respect to a function of bounded variation F on an 
interval I=[Oyb]y we shall write 

b b 

(S) fg dF= f g dF — {g(a) ■ {F (a) — F («-)]+ g(b) ■ [F(b+)-F(b)]). 

a a b 

The number (S)fgdF will be called definite S-integral of g with 

a 

respect to F over I. As we see at once, this number (unlike the 
Lebesgue-Stieltjes integral) does not depend on the values taken 
by the function F outside the interval /, and for each point c 
of [a, 6] we have 

ebb 

(S)fgdF + (S)f g dF — (S) f gdF. 

a c a 

(2.1) Theorem . Let g be a bounded function integrable with respect 
to a monotone non-decreasing function F on an interval [a, 6]. Then : 

b 

(i) (S)fgdF — /Li'[F(b) — jF(a)], where p is a number between 

a 

the bounds of the function g on [a,fc]; 

X 

(ii) writing .S' (x) = (S) j g dF for a^x^b, we have S'(x) — 

a 

= g(x)-F'(x) at almost all points of continuity of the function g , and 
in fact at every point x where g is continuous and F derivable. 
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Proof. Clearly (i) follows at once from the obvious inequality 

b 

m-[F(b) — F(a)]^(S)JgdF^M-[F(b) — F(a)], where m and M are 

a 

the lower and upper bounds of g on [a, 6]. In order to establish (ii), 
consider a point x 0 at which g is continuous and F is derivable. 
We may suppose, by subtracting a constant from g if necessary, 
that g(x 0 )= 0. Denoting, for each interval J, by e(J) the upper 
bound of \g(x)\ on J, we have \S(J)\/\J\^e(J)-F(J) l\J\ and taking J 
to be an interval containing x 0 and of length tending to zero, we 
find S'(x o ) = 0 = g(x o ). This completes the proof. 

(2.2) Lemma. Let F be a function of bounded variation on an interval 
I 0 -- [a, b], 0 a continuous function on I 0 , and U the function defin- 
ed on I 0 by the formula 

X 

(2.3) H(x)=F(x) 0(x)—(S) j G(t) dF(t) for a^ix^b. 

a 

Then , if the function 0 is ACG [ACG*] on Z 0 , so is the function H. 

Proof. We may clearly assume F to be monotone non-de- 
creasing. Denoting by M 0 the upper bound of \F(x)\ on i 0 , we shall 
begin by proving that for every interval IC^o we have 

(2.4) \H(I)\^M 0 -\G(I)\+O(Qi I)-F(I) and 0(H; Z)<3Jf 0 -O(<7; I). 
In fact, by Theorem 2.1 (i), we have, for every subinterval J=[a,/?] of I, 

p 

H(p)-H(a) = [G(p)-G(a)]-F(p)+[F(p)-F(a)]-G(a)-(S)fG(t)dF(t) 

= [G(P)-G(a)]F(P)+[F(P)-F(a)l[G(a)-n], 

where y is a number between the bounds of G on J. Consequently, 
|fl'(J)Kilf 0 fG , (d r )| + O(G;t7) l 1 (J), and the first of the relations (2.4) 
follows by choosing J = 1. On the other hand, we derive 
\H(J)\^3M 0 -O(G; I) for every interval J CI^ and hence the second 
relation (2.4). 

Hence, since the function G is continuous, the function H is 
continuous also. Further, if {I k ) is any finite sequence of non- 
overlapping intervals and if m denotes the largest of the numbers 
0(G; I*), we obtain from the relations (2.4) 

E\H(I„)\^M 0 -Z\G{I k )\ + (o-F(I 0 ) and ZO(H-,I k )^3M 0 ‘ZO(G,I k ). 

* k k ft 
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The first of these inequalities implies that if the function G is AC 
on a set E, so is the function H, and consequently, that if the func- 
tion G is ACG on the whole interval h, then the function H is also 
ACG on 7 0 . Similarly, the second of the above inequalities shows 
that if G is ACG* on I 0 then so is F, and this completes the proof. 

We can now complete Theorem 14.8, Chap. Ill, which concerned 
integration by parts for the Lebesgue integral, by establishing 
a similar theorem for the Denjoy integrals: 

(2.5) Theorem. If F(x) is a function of bounded variation and g(x) 
afunction £>- or S^-integrable on an interval I 0 =[a,b], then the function 
F(x) g(x) is integrable on I 0 in the same sense, and moreover denoting 
by G the indefinite integral of g, we have 

h b 

(S') [F(x)g(x)dx=G(b)F(b)-G(a)F(o)-(S) f G(x)dF(x). 

a a 

Proof. We shall establish the theorem for the ^-integral. 
The proof for the £>* -integral is quite similar. 

By Lemma 2.2, the function H defined by the formula (2.3) 
is ACG on I 0 . Moreover, by Theorem 2.1 (ii), if we form the ap- 
proximate derivative of both sides of (2.3), we obtain almost every- 
where the relation H' &9 (x) — F(x)G' a . v (x)=F(x)g(x). It follows that 
the function F(x)g(x) is 9-integrable on the interval I 0 and that 

b 

(S>) f F(x) g(x) dx—H(b)~H(a). This last relation is equivalent to 

a 

the one to be proved. 

The idea of the above proof, which is directly based on the descriptive defi- 
nition of the Denjoy integrals, is due to Zygmund. For another proof, depending 
on the constructive definition of these integrals, cf. for instance E. W. Hobson 
[I, p. 711]. For an interesting generalization of the theorem on integration by 
parts to the c/^-integral (cf. Chap. VI, §8) vide A. J. Ward [3]. 

From Theorem 2.5, there follows easily the second mean 
value theorem for the Denjoy integral, which may be regarded 
as a generalization of Theorem 14.10, Chap. III. 

(2.6) Theorem . Given a non-decreasing function F on an interval 
I 0 =[a,b] and a function g which is 9-integrable on h, there must 
exist a point £ in I 0 such that 

b i b 

(9)fg(x)F(x)dx=F(a)-(S)) j g(x) dx + F(b)-(9) j g(x)dx. 
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Proof. Writing G(x) = (@) f g(t)dt, we have by Theorems 2.5 

a 

and 2.1 (i) the relation 

b b 

(0) f g(x)F(x)dx = 0(b) F(b) — (S) f G{x)dF(x) = 

a a 

^G(b)F(b)^^[F(b)-F(a)]^^F(a) + [G(b)-^F(b), 

where /i is a number between the bounds of G(x) on I. It follows 
that there exists a point £ in I 0 such that /*=(?(£), and the rela- 
tion just obtained becomes 

b 

(S>) f g(x) F(x)dx=F(a)-GU)+F(bHQ(b)-Gm, 

a 

which, by definition of G{x), reduces to the required formula. 

§3. Theorem of Hake-Alexandroff-Looman. The relations 
between the Den joy integrals and those of Lebesgue and of Newton 
having already been obtained in §1, we now proceed to establish 
an important result of Hake, Alexandroff and Looman, which asserts 
the equivalence of integration in the restricted Den joy sense with 
Perron integration. 

At the same time we shall show that in the definition of Perron 
integral (Chap. VI, § 6) we need only take account of the continuous 
major and minor functions. In order to make this assertion quite 
precise, let us agree to say that a function / is F 0 -integrable on an 
interval / 0 if 1° the function has continuous major and minor func- 
tions on I 0 and 2° denoting by U any continuous major function 
and by V any continuous minor function of / on Z 0 , the lower bound 
of the numbers U (I 0 ) is equal to the upper bound of the numbers 
F(/ 0 ). The function / is then plainly ^-integrable on Z 0 , the definite 
^-integral of / on I 0 being equal to this common bound. We have 
to prove the converse, i. e. that every function which is c^-inte- 
grable is also d^-integrable. 

(3.1) Lemma. If a function f is <F 0 -integrable on each interval interior 
to an interval [a, b] and if the definite tP-integral over the interval 
[a + e,b — rj] tends to a finite limit as e-^0+ and ^->0+, then the 
function f is F 0 -integrable on the whole interval [a, 6]. 
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Proof. It is clearly sufficient (by halving the given interval) 
to consider the case of a function / which is <f 0 -integrable on each 

X 

interval of the form [a, 5— e] where 0 <e<b—a. Let P(x)=(F) f fdx 

a 

for a^x<b and p = P{b—). 

We choose any positive number cr. Writing for symmetry o 0 =o, 
we consider any increasing sequence of points {«*}*= o.i,... w hich con- 
verges to b. The function f being ^-integrable on each interval 
[a*, a*4-i], we easily define, on the half open interval [a, b), a con- 
tinuous function F such that F is a major function of / on each 
of the intervals [a*, o* + i] and that [F(a>) —F{a k )] ~[P{x) — P(«*)]<a/2* 
for and Zc=0, 1, .... By the second of these conditions 

the oscillation of the function F on the interval [a*, 6) tends to 0 
as k-+ oo, and therefore F has a finite limit F(b-) at the point b. 
Writing F(x)—F(a) + (x — a) 1 3 for x<a, and F(x)=F(b—) for x^b, 
we extend the definition of F to make this function continuous 
on the whole straight line R v and the following conditions are then 
satisfied: 

(3.2) —oo4=F(x)^tf{x) for a^.x<b and (3.3) F(b)—F(a)^.p+2o. 

Now let c be an interior point of [a, 6] such that the oscillation 
of F on [c,6] is less than a. For each point x of [c,6], let 0(x) denote 
the oscillation of F on [x,b\ The function 0{x) is continuous and 
non-increasing on the interval [c,i], and we extend its definition 
on to the whole straight line by making 0(x)—0(c) for x<o 
and 0(x)=0(b) = 0 for x>b. We now write Q(x)=F(x)—0(x) 
and U(x)— G(x) a ■ (x — b)' 13 j (b — a)' 3 . Since the function 
o-{x—b) x *l{b—a) li —0{x) is non-decreasing, it follows at once from 
(3.2) that —004 -U(x)^f(x) for a^x<b. Moreover, since 0(b)— G(x) 
is non -negative for each point x of the interval [c,fe], and 0 for 
x^b, we find G(b)^ 0, and therefore U(b)—+oo. Hence U is a con- 
tinuous major function of / on the interval [a, 6] and fulfils, by (3.3), 
the inequality U(b)—U(a)^:F(b)—F(a)+2o^ : p+4o. Similarly we 
define a function V which is a continuous minor function of / on 
[a, 6] and fulfils the condition V(b)—V(a)^p—io. It follows that 
the function / is <F 0 -integrable on [a, 6], This completes the proof. 
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(3.4) Lemma. Let Q be a closed . emd bounded set , a, b its bounds , 
{Z*=[aA, 6 a ]} the sequence of intervals contiguous to Q , and f a function 
which is summable on Q and S^-integrable on each interval contiguous to Q. 

Then , if the series of the oscillations of the indefinite integrals 
of the function f on the intervals Ik converges , the function f is 
^Q-integrable on the whole interval [a, 6] and we have 

b b k 

(3.5) (■?) I fdx = I fdx + y(&) I fdx. 

a q * <*'* 

Proof. Let e be a positive number and let if be a positive 
integer such that 

oo 

(3.6) IO„<e, 

k~K-\ 1 

where Oh denotes the oscillation of the indefinite ^-integral of f 
on the interval I k . Denote by f x the function which agrees with / 
on the set Q and on the intervals Ik for k^K, and which is 0 
elsewhere. By Theorem 3.2, Chap. VI, and by the hypotheses 
of the lemma, the function f x has a continuous major function U 1 
and a continuous minor function V x such that 

(3.7) U^-UM-e^ ffdx+SWffdx^VM-ViW + e. 

Q k ~ 1 4 

We shall now define a continuous major function for f — f x . 
Let F k be, for each k, a continuous major function of / on 
the interval I k such that F k (a k ) = 0 and 0(24; 2*)<20*, and let A k (x) 
and B/,{x) denote, for any point xel k , the oscillations of the func- 
tion F k on the intervals [a*,#] and (4,6*] respectively. We write 
0(x)= F k (x)+A k (x)—[B k (x)—B k (a k )} when xel k and k>K, and 
0(x)= 0 elsewhere. Finally, for each x , we write 

U 2 (x)=G(r)+Z {x) G(b k -), 

k 

where the summation it. extended over the indices k for 

A 

which b k ^x. Since, for every k, we have G(a k -\-) = G(a k ) — 0 and 
0(G;I*)<3-0(F*; 2*)<6 0*, the function U 2 is continuous on the 
Btraight line R v and since the function (2 vanishes identically on 
each interval I k for k^K, we have by (3.6), 

oo 

(3.8) U 2 (b)-U 2 (a)^6-ZO k ^6e. 

k=K+\ 
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Now, for each k, we have 0[x )~^ 0 and G(b k — ) — (?(#)> 0 for 
every point £<-/*. Therefore the increment of the function V 2 is 
non-negative on each interval containing points of the set Q, and 
consequently U 2 (x)^0=f{x)—f 1 (x) at each point x of this set. 
Again, since the function G vanishes on each interval I* for k^K, 
we have U 2 (x) — 0—f(x)—f t (x), whenever xeli for k^K. Finally, 
since the function A k (x)—B k (x) is non-decreasing on each l k , we- 
see that — oo4=U 2 (x)^F k (x)^f(x)=f(x)—f 1 (x) at each point xel k 
for k>K. Thus U 2 is a continuous major function of / — on [a, 6]. 
Similarly, we determine a continuous minor function V 2 of / — f v 
subject to the condition V 2 (b)—V 2 (a)^ — 6e which corresponds to 

(3.8) . Therefore, writing (J—U 1 +U 2 and V=V 1 + V 2 , we obtain 
a continuous major function U and a continuous minor function V 
for / on [ a,b], and if we denote by p the right-hand side of (3.5), 
we obtain from (3.6) and (3.7), U(b) — U (a) —8 e<p<F(6) — F(a)+8£. 
The function / is thus ^ 0 -integrable on the interval [a, b] and its 
definite •‘/’-integral over this interval is given by the formula (3.5). 

(3.9) Theorem. A function f which is & m -integrable on an interval I 0 
is necessarily F 0 -integrable on I 0 , and we have 

(Q.)[fdx=(f)Jfdx. 

4 /„ 

Proof. Let F be an indefinite 0* -integral of / on I 0 . We call 
an interval I(ZI 0 regular, if the function / is <^ 0 -integrable on I 
and if the function F is on / an indefinite ii 1 integral of /. Further, 
we call a point xel 0 regular, if each sufficiently small interval 
IQIo containing x is regular. Let P be the set of the non-regular 
points of I 0 . We see at once that the set P is closed and that every 
subinterval of I 0 which contains no points of this set is regular. 
We have to prove that the set P is empty. 

Suppose, if possible, that P4=0. By Lemma 3.1 we see easily 
that every interval contiguous to P is regular and that the set P 
therefore has no isolated points. On the other hand, by Theorem 9.1, 
Chap. VII, the set P contains a portion P 0 on which the function F 
is AC,. Let J 0 be the smallest interval containing P 0 . Since the 
set P has no isolated points, the same is true of any portion of P, 
and therefore P-Jo+0. It follows that in order to obtain a con- 
tradiction, which will justify our assertion, we need only prove that 
the interval J 0 is regular. 
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To show this, let J be any subinterval of J 0 and let Q be the 
set consisting of the points of the set PJ and of the end-points of J. 
We denote by {/„} the sequence of the intervals contiguous to Q 
and by G the function which coincides with F on Q and is linear 
on the intervals I n . Plainly the function G is absolutely continuous 
on J. Therefore, since G'(x) — F'(x)==f(x) at almost all points x of Q , 
and since G{I n ) — F(I n ) for each n, we obtain 

(3.10) F(J) = 0 (J)= f G‘(x)dx=£F(I„) + [f(x)dx. 

J " Q 

Now the function / is summable on Q and <^ 0 -integrable on 
each interval I n and moreover, F is an indefinite ^-integral of / on 
each of these intervals. The series of the oscillations of F on the 
intervals I n being convergent, it follows, by Lemma 3.4, that the 
function / is ^-integrable on J and that, on account of (3.10), 

F(J) = (F) j f dx. Therefore, since J is any subinterval of J 0J the 
j 

interval J 0 is regular and this completes the proof. 

(3.11) Theorem . A function which is tf-integrable on an interval I 0 
is necessarily &+-integrable on I 0 . 

Proof. Let / be a function ^-integrable on an interval I 0 
and let P be its indefinite ^-integral. We shall show that the func- 
tion P is an indefinite £>• -integral of /. Since P'(x) = f(x) almost 
everywhere (cf. Theorem 6.1, Chap. VI), it is enough to show that 
the function P is AOG* on / 0 , i. e. that any closed set QC.Iq oon * 
tains a portion on which the function P is AC*. 

Let H be any major function of /. Since H(x)>— 00 at each 
point x of I 0 , the function H is by Theorem 10.1, Chap. VII, VBG* 
on I 0 , and hence I 0 is expressible as the sum of a sequence of closed 
sets (cf. Theorem 7.1, Chap. VII) on each of which the function U 
is VB*. It follows, by Baire’s Theorem (Theorem 9.2, Chap. II) that 
the set Q contains a portion Q 0 on which the function H is VB*. 
Since the difference P— H is a monotone function, the function P 
is actually VB* on Q q . We shall show that P is further AC* on Q 0 . 

For this purpose, we denote by J 0 =[a,6] the smallest interval 
containing Q 0 . Let e be any positive number and U a major func- 
tion of / on I 0 such that 

(3.12) 


U(I 0 )-P(I 0 )<e. 



252 


CHAPTER VIII. Denjoy integrals. 


Let P x and U x denote the functions which coincide on Q 0 with the 
functions P and U respectively, and which are linear on the intervals 
contiguous to $ 0 and constant on the half-lines (— oo, a] and 
The function P x is clearly of bounded variation. On the other hand, 
we see easily that U x (x )> — o° at every point, and that U 1 (x)^P\(x) 
at almost all points, of the interval J 0 . Therefore, writing f x {x) — P\(x) 
wherever the second of the above inequalities holds, and f x {x)=— oo 
elsewhere, we see at once that the function f x (x) is summable on J 0 

and has TJ X for a major function. It follows that U X (I)^ j f x (x) dx 

i 

for each interval /CVo> an( l therefore that the function of singularities 
(cf. Chap. IV, p. .120) of U 1 is monotone non-decreasing on </ 0 . Let T x 
be the function of singularities of P x . Since the function P x — U x 
is monotone non-increasing on J 0 and since, by (3.12), we have 
Q>P l (J 0 )-U l (J 0 )=P{J 0 )-U(Jo)>—e> it follows that l\(I)^-e 
for each interval IC.J 0 ; and e being any positive number, this 
requires l\(l )> 0 for every interval IC^o- Similarly, by considering 
minor functions of / in place of major functions, we find 2\(i)^0, 
and therefore, finally, 7\(/) ~ 0, for each interval ZC^o- The func- 
tion P x is thus absolutely continuous on J 0 . This requires the func- 
tion P to be AC on the set Q q as well as VB*, and therefore AC* 
on this set on account of Theorem 8.8, Chap. VII. Thus every clos- 
ed set QCIo contains a portion Q 0 on whicfy the function P is 
AC*, and this completes the proof. 

The first of the theorems proved in this §, which together establish the 
equivalence of the processes of ^ 0 - and ^-integration, was derived 

in 1921 by II. Hake [1] from the constructive definition of the integral 
(vide below, § 5). The second theorem was obtained some years later by P. Alex- 
androff [ 1 ; 2 ] and H. Looman [4] independently. For an interesting extension 
of these results to Perron-Stieltjes integral, vide A. J. Ward [3]. 

It should, perhaps, be added that in their original definitions 0. Per- 
ron [1] and 0. Bauer [1] employed only continuous major and minor functions. 
The equivalence of the original Perron -Bauer definition with that of Chap. VI, 
§ 6, has therefore been established here as a consequence of Theorems 3.9 et 3.11. 

Let us remark further that in the definition of Perron integral, ordinary 
major and minor functions may be replaced by generalized continuous major 
and minor functions defined as follows. A function U is a generalized continuous 
major function of a function / on an interval I if 1° U is continuous and VBG* 
on Z, 2° the set of the values assumed by U at the points at which U'(x ) — — oo, 
is of measure zero, and 3° U(x)^f(x) at almost all points x. The definition of 
generalized continuous minor functions is obtained by symmetry. 
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We shall conclude this § with the following result, due to 
Marcinkiewicz: 

(3.13) Theorem. A measurable function f which has on Z 0 at least 
one continuous major function and at least one continuous minor 
function , is necessarily &-integrable on Z 0 . 

Proof. Let U and V be respectively a continuous major func- 
tion and a continuous minor function of / on Z 0 . We shall call 
a point xel 0 regular if the function / is <^-integrable on each suf- 
ficiently small interval I Clio which contains x. Let Q be the set 
of the points x of I 0 which are not regular. The set Q is plainly closed 
and we see at once that the function / is <F-integrable on each sub- 
interval of J 0 which contains no points of Q . Thus it has to be proved 
that 

Suppose, if possible, that Q=^ 0. For every interval I on which 

the function / is ^-integrabie, we have U(I)^(c?) f f(x)dx^V(I). 

i 

Therefore, if [a,&] is an interval contiguous to Q , the definite 
<F-integral of / on the interval [a + e, b — r}] interior to [a, b] tends 
to a finite limit as £->0 and 0. By Lemma 3.1, the function / 
is thus c^-integrable on each interval contiguous to Q. It follows, 
in particular, that Q can have no isolated points. 

Now let Q 0 be a portion of Q on which the functions V and V 
are both VB*. Such a portion exists by Theorem 9.1, Chap. VII, 
since the functions U and V are VBG* on I 0 on account of Theorem 10.1, 
Chap. VII. Let J 0 be the smallest interval containing Q 0 . Since 
U(x)^f(x)^ V (x) everywhere on Z 0 , the function / is summable 
on Q 0 together with the two derivatives U(x) and V(x). On the 
other hand, denoting by {!„} the sequence of the intervals contiguous 
to Q 0 and by O n the oscillation on I n of the indefinite ^-integral 
of /, we shall have 0„<O( Uj Z„)-f-0(F; I n ) for every n, and so 
£O n <+ 00 • It follows by Lemma 3.4, that the function / is S'- inte- 

n 

grable on the whole interval J 0 . But this is clearly impossible, for 
since the set Q has no isolated points, the interval J 0 contains in 
its interior some points of Q . We thus arrive at a contradiction 
which completes the proof. 

Just as in the definition of the Perron integral, we may replace, in The* 
orem 3.13, ordinary major and minor functions by generalized continuous ones 
(cf. above, p. 252). Nevertheless, the conditions of Theorem 3.13 differ from those 
of the definition of Chap. VI, § 6, in that continuity is essential. In fact, if 
we write f{x) — 0 for and f(x) = — 1/x* for x>0, U(x)=0 identically in Ji x 

and F(as) — 0 for and V (x) — l/x for x>0, we see at once that U and \ are 
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* 8 4. General notion of Integral. We shall deal in this § 
with some notions of a more abstract kind which we shall employ, 
in the next §, as a basis for the constructive definition of the 
Denjoy integrals. 

Let £ be a functional operation by which there corresponds 
to each interval l = [a,b ] a class of functions defined on I, and to 
each function / of this class a finite real number. This class of func- 
tions will be called domain of the operation £ on the interval I, and 
the number associated with / will be denoted by £(/;Z). 

An operation £ will be termed an integral, if the following three 
conditions are fulfilled: 

(i) If a function / belongs to the domain of the operation £ 
on an interval 1 0 , the function belongs also to the domain of £ on 
any interval /(~Z 0 , and c(/; Z) is a continuous additive function 
of the interval IQ1 0 . 

(ii) If a function / belongs to the domain of the operation £ 
on two abutting intervals I, and Z 2 , the function belongs also to 
the domain of £ on the interval I, -f Z 2 . 

(iii) A function / which vanishes identically on an interval I 
belongs to the domain of £ on Z, and we have £(/;/) = 0. 

If £ is an integral, any function / which belongs to the domain 
of £ on an interval I 0 will be termed £-integrable on I 0 and the 
number (T(/;Z 0 ) will be called definite £-integral of / on I 0 . The 
function of an interval I CZ 0 > £(fj Z), which is additive and continuous 
on account of (i), will then be called indefinite £-integral of / on I 0 
and its oscillation on I 0 (i. e. the upper bound of the numbers | £{f\ Z)|, 
where l denotes any subinterval of I 0 ) will be denoted by 0(£; /; Z 0 ). 

Two integrals £, and £., will be termed compatible, if 
Sj(/; I) — £ 2 (f-, I) for every interval I and for every function / 
which is both Cj- and C 2 -integrable on I. 

We shall Say that the integral £ 2 includes the integral 
if the two integrals are compatible and if every function which is 
fj-integrable is also C 2 -integrable. When this is so we shall write 

Given an integral £ and a function g which vanishes out- 
side a bounded set E, it is evident that if g is C-integrable on an 
interval I 0 which contains E in its interior, then g is so also on 
any interval I which contains E, and we have £{g\ I)=£{g\ I 0 ). 
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This fact justifies the following definition: we shall say that 
a function / is C-integrable on a bounded set E, if the function g which 
coincides- with f on E and is 0 elsewhere, is C-integrable on each 
interval ZD-B. The number C(<j;Z) is then independent of the choice 
of the interval 1^)2?; we shall call this number definite C- integral 
of the function / on the set E and we shall denote it by C(/; E). 

Of the known processes of integration, ail those which give rise to a con- 
tinuous indefinite integral (for instance those of Lebesgue, Newton, Denjoy, etc.) 
are easily seen to be integrals according to the above definition. If, however^ 
we wished to include also discontinuous integrals (e. g. that of W. H. Young 
cf. Chap. VII, p. 215) we should have to modify some details of the definition. 

Given a function / on an interval I 0 and given an integral C, 
we shall say that a point ael 0 is a C- singular point of / in Z 0 if there 
exist arbitrarily small intervals I(ZI 0 containing a on each of which 
the function / is not C-integrable. Denoting by S the set of these 
points, we see at once that the set 8 is closed and that the function / 
is C-integrable on every subinterval of Z 0 which contains no points of 8. 

With each integral C we now associate three “generalized” 
integrals C c , C H and C 11 *. defined as follows. 

Given any interval the domain of the operation C ' on 7 0 
is the class of all the functions f which fulfil the following two 
conditions: 

(c 1 ) the set of the C-singular points of / in Z 0 is finite (or empty); 

(c 2 ) there exists a continuous additive function of an interval F 
on Z 0 such that F(I)=C(f- J) whenever 1 is a subinterval of i 0 
which contains no C-singular point of /. 

Since such a function F (if existent) is uniquely determined 
by the conditions (c 1 ) and (c 2 ), we can write C* (/; I 0 )—F(I 0 ). 

The domain of the operation C H on i 0 is defined as the class 
of the functions / which fulfil the following conditions: 

(h 1 ) if S denotes the set of all C-singular points of / in i 0 , the 
function / is C-integrable on the set S and on each of the intervals 7* 
contiguous to the set consisting of the points of S and of the end- 
points of i •; 

(h 2 ) ZW;h)\ < + oo and, in the case in which the sequence (7*) 

k 

is infinite, lim 0(C; /; Z*) = 0. 

k 



256 


CHAPTER VIII. Denjoy integrals. 


For any such function /, we write by definition: 

£ H (/;!„) =£*</; !*) + £(/; 8 ). 

k 

Finally, we obtain the definition of the operation £ 11 * by re- 
placing in the definition of the operation <Z" H the condition (h 2 ) by 
the more restrictive condition: 

(h*) 10 (ff;/;J*)< + oo. 

We verify at once that the operations £ c , € n and £ H * all 
fulfil the conditions (i), (ii) and (iii), p. 254. These operations are there- 
fore integrals according to the definition, p.254, and we evidently 
have and S’C^ H *C^ H * For brevity, we shall write C ( H and 

£ ch * in place of (C c ) H and respectively. 

The integral £ c and the integrals c H and c H * may be regarded respectively 
as the Cauchy and the Harnack generalizations of the integral a. They correspond, 
in fact, to the classical processes employed by Cauchy and Harnack to extend 
integration from bounded to unbounded functions of certain classes. The original 
process of Harnack actually corresponds to the operation <T H + rather than to the 
operation Cf. A. Harnack [1], E. W. Hobson [I, Chap. VIII] and A. Rosen- 
thal [I, p. 1053]. 

If we were to add to the conditions (h 1 ) and (h 2 ) which characterize the 

generalized integral £ H , the condition that lim 0(cT; /;/*)/(>(£, I*) — 0 for almost 

k 

all x € S, we should arrive at a generalized integral C n intermediate between 
c H and C H *. By applying the process C !i in the constructive definitions of Den- 
joy integrals of the next §, we should then obtain an integral v >/ , intermediate 
between L d and Its descriptive definition is very simple: a function f is l ?'- 
integrable if it is $-integrable and if its indefinite integral is almost everywhere 
derivable (in the ordinary sense). This integral has been discussed by A. Khin* 
tchine fl]; cf. also J. C. Burkill [1]. 

# g 5. Constructive definition of the Denjoy integrals* 

With the notation of the preceding §, we see at once that for each 
integral £C®> we have also £ G C®5 similarly the relation cC®* 
implies £ c C®** It is not quite so obvious that the relations £C® 
and £C®* imply respectively £ H C® an( l £ ri *C®*- This last as- 
sertion is a consequence of the following theorem which is anal- 
ogous to Lemma 3.4. 
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(5.1) Theorem. Let Q be a bounded closed set with the bounds 
a and 6, and let {I k } be the sequence of intervals contiguous to 
and suppose that f is a function 9-integrable on the set Q as well as 
on each of the intervals I*, and that (in the case in which the sequence 
{Ik} is infinite) 

%\(3) J f dx\<+ oo and Um 0(0; /; !*) = (). 

* * 


Then the function f is $-integrable on the whole interval 
I=[a,b ] and we have 

(5.2) (3)ffdx=(3)ffdx + Z(3)ffdx. 

I Q *4 

If we suppose, further, that the function f is 3 n -integrable on Q 
as well as on each of the intervals I k and that ^0(9*; f; I k )< + oo, 
then the function f is 3^-integrable on 1. k 

Proof. We shall prove the theorem for the 9-integral. The 
case of the 9, -integral is similar. 

Let I(x) denote the interval [a,x] where we suppose xe[a,J], 
and let 

(5.3) F(x)=Z(3) ff(t)dt. 

* /*•/<*> 


We shall show that the function F, thus defined, is ACG on 
the interval I. For this purpose, it will suffice to show that F is 
AC on the set Q, the function being evidently continuous on I and 
ACG on each of the intervals /*. 


Let g(x) be the function equal to 0 for xeQ and to -(3)1 f(t)dt 
for xel% where k— 1,2, The function g is summable 


‘k 

on I 


and 


if 0(x)— f g(t) dt, the function F clearly coincides with O on Q; 

a 

F is thus AC on Q and therefore ACG on I. 

This being so, we have F' ti .,(x)=0'(x) = g(x)=0 at almost all 
points x of Q, while it follows at once from (5.3) that FL f (x)—f(x) at 
almost all points x of I — Q. Hence, F being ACG on I, it follows 
that the function equal to / on I — Q and to 0 on Q has F for an 
indefinite 9-integral. On the other hand, the function equal to f 
on Q and to 0 elsewhere is, by hypothesis, 9-integrable on I. 
It follows that the function / itself is 9-integrable on I, and 
that (9) f f(x)dx=F{b)—F(a) + (3) jfdx, which, on account of (5.3), 
i Q 

is equivalent to (5.2). This completes the proof. 
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We now pass on to the constructive definition of the Denjoy 
integrals. We begin by introducing the following notation. 

Let {£•} be a sequence of integrals, in general transfinite, such 
that £'(Z &' 1 whenever £<»?. We then denote by the operation € 

£<a s 

whose domain on each interval I is the sum of the domains of the 
operations e 5 for £<a, and which is defined for every function / 
of its domain by the relation £(/; 7)=£-°(/-, 7), where | 0 is the least 
of the indices f< a such that / is ST'-integrable on 7. It then follows, 
of course, that €(/; 7)— £*(/; I) for every f >£ 0 , since by hypothesis C 1 
then includes £■". 

This being so, let {£-} and be two transfinite sequences 
defined, by an induction starting with the Lebesgue integral £, as 
follows: 

T° = £0 _ £f 

£ a = (2£?) CH and X*“ = (^7 i7|) CH * for a>0. 

$< a £<a 


Denoting by Q the smallest ordinal number of the third class 
(cf. for instance, W. Sierpiriski [I, p. 235]) we shall show that 


{."Q 




and s'* = Z £l = £, 

l-fi 


We shall restrict ourselves to the case of the ^-integral (that of 
the ^-integral being quite similar). 

Since £(Z we find at once by induction (cf. above, p. 256) 

that for every f, £-(Z and so, obviously, X £~ C In order to 

£• a 

change this last relation into one of identity, it is enough to show 
that every function / which is 0-integrable on an interval 7 0 =[a,fc], 
is i’Mntegrable on Z 0 for some index £<Q. 

Let denote the set of the -^-singular points of / in 7 0 . The 
sequence ($ ; ), as a descending sequence of closed sets, is sta- 
tionary, i. e. there exists an index v<Q such that — S v+X - 
(For if not, there would exist for every £ < Q a point 
x^eS*— <S rl+1 , and therefore also an interval Zt with rational end- 
points, containing the point x: of S’ but without points in common 
with the closed set 8* + \ nor therefore, with any of the sets 
$’ +2 , S ’ +3 , ... . We should thus obtain a transfinite sequence of type 
Q of distinct intervals with rational end-points, and this is impossible.) 
We shall prove that 
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Suppose, if possible, that /S^O. We see at once that the func- 
tion / is £ v -integrable on each interval I C ! 0 which contains no 
points of 8 V . It follows that the function / is (t’ y ) c -integrable, and 
a fortiori r v+1 -integrable, on each interval contiguous to 8". Since 
8 v =8 v+t , it follows, in particular, that the set 8 V contains no iso- 
lated points. 

The function / being, by hypothesis, £>-integrable on I 0 , the 
set 8" (cf. Theorem 1.4) must contain a portion Q such that the 
function / is summable on Q and such that the 3ories of the definite 
£*-integrals of / over the intervals contiguous to Q converges absolu- 
tely. Since A’C (-O^C ^ follows at once that the function / is 

(r v ) < ' H -integrable, i.e. H^'-integrable, on some interval J 0 contain- 
ing Q. But this is clearly impossible, since, in view of the fact that 
the set 8 V has no isolated points, the interval J 0 certainly contains 
points of the set S v —S v+ ' in its interior. 

We thus have /S‘ , = 0, which establishes the r v -integrability of 
/ on I 0 and completes the proof. 

Various definitions, constructive and descriptive, of Den joy integrals will 
be found in the papers mentioned in Chap. VI, p. 207, and Chap. VII, pp. 214-215, 
as well as in the following treatises and memoirs: N. Lusin [I; 4], T. II. fl il- 
debrandt [1], P. Nalli [I], E. Kamke [I], A. Kolmogoroff [2], 
H. Lebesgue [7; II, Chap. X], A. Rosenthal [1] and P. Romanowski [1]. 

For further extensions to functions of two or more variables, see also 
H. Looman [1] and M. Krzyzanski [1]. 



CHAPTER IX. 

Derivates of functions of one or two real variables. 

§ 1. Some elementary theorems. The first part of this 
chapter (§§ 1—10) is devoted to studying the various relations between 
the derivates of a function of a real variable. With the help of the no- 
tion of extreme differentials introduced by Haslam-Jones, certain of 
these relations will subsequently be extended, in the second part of 
the chapter (§§ 11—14), to functions of two variables. 

Accordingly, the term “function” will be restricted in the first 
part of this chapter to mean function of one real variable. 

Before proceeding to the theorems directly connected with the 
Lebesgue theory, we shall establish in this § some elementary results. 

We first observe that a linear set E contains at most a finite num- 
ber, or an enumerable infinity, of points which are isolated on one side at 
least. To fix the ideas, let A be the set of the points of E which are 
isolated points of E on the right. For each integer n, let A„ denote the 
set of the points x of A such that the interval [x,x+l/n] contains no 
point of E other than x. Then it is plain that, for each integer ft, the in- 
terval [ft/n, (ft+ l)/n] can have at most one point in common with A„. 
Hence each set A„ is at most enumerable, and the same is true of 
the set A — £ A„. 

n 

We say that a finite function F assumes at a point x 0 a strict 
maximum if there exists an open interval I containing x 0 such that 
F(x)<F(x 0 ) for every point xe I other than x„. By symmetry we define 
a strict minimum. 
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(1.1) Theorem. Given a finite function of a real variable F , each of 
the following sets is at most enumerable: 

(i) the set of the points at which the function F assumes a strict 
maximum or minimum ; 

(ii) the set of the points x at which 

lim sup F{t) ^lim sup F(t) or liminf F(t)< lim inf F(t)', 

t-+X t+x+ t-+x t-+x + 

(iii) the set of the points x at which 

F^(x)<F~(x) or F~(x)<F+(x). 

Proof, re (i). Consider the set A of the points at which, for in- 
stance, the function F assumes a strict maximum, and let A n denote, 
for each positive integer n, the set of the points x such that F{t)<F(x) 
holds for each point t^x of the interval (x — 1/n, #+l/n). We see at 
once that each set A n is isolated, and therefore at most enumerable. 
Since A=2jA n , it follows that the set A is at most enumerable. 

n 

re (ii). Let us consider, for definiteness, the set B of the points 
x at which lim sup F(t)>\im sup F(t). We denote, for each pair of in- 

t~+x t-+x~\- 

tegers p and q , by B p>q the set of the points x such that 
lim sup F(t) > p/q > lim sup F(t). 

t-+ X /->J c+ 

Clearly each point of a set B p>q is, for that set, an isolated point on 
the right. Each of the sets B PtQ is thus at most enumerable, and, since 

B—2jB pqj the same is true of the whole set B. 

p,q _ 

re (iii). Consider the set C of the points x at which F+(x)<JT~(x). 
and denote, for each pair of integers q> 0 and p, by C Ptq the set of the 
points x at which F+(x)<p!q<F r ~{x). Write F p>q (x)—F(x)—pxlq. We 
find F PfQ (x)<0<F^ q (x) at each point xtC Piq , and this shows that the 
function F p , q assumes a strict maximum at each point of C p , q . By the 
result just established, each set C PyQ is at most enumerable, and con- 
sequently, the same is true of the whole set G. 

It is sometimes convenient (vide, below, § 5) to appeal to a 
slightly more general form of the last part of Theorem 1.1, which 
concerns relative derivates (cf. Chap. IV, p. 108) and which reads thus: 

(1.2) Theorem . If U and F are two finite functions of a real variable , 

the set of the points t at which the derivative (finite or infinite) 

exists and at which Fu(t)<Fu(t ), is at most enumerable . 



262 


CHAPTER IX. Derivates of functions of one or two real variables. 


This is proved in the same way as the corresponding part of 
Theoreml.l. In fact, if we denote, for every pair of integers q > 0 and p, 
by Cp mll the set of the points x at which Fu(x)<p/q<Fu(x) 9 
we sec at once that the function F(x)-—(p/q)-U(x) assumes at each 
point of <' pq a strict maximum. Therefore each set C M is at most 
^numerable. 

For Theorem 1.1 and its various generalizations, vide: A. Den joy [1, p. 147], 
B Levi [ 1 ]. A. Rosenthal [1], A. Sell on flies [I, p. 158], W. Sierpiriski [1 ; 2] 
and G ( Y oung [ 1]. As regards the enumerability of the set of the points at which 
the function assumes a strict maximum or minimum, it is easily seen that this 
reMilt remains valid for functions in any separable metrical space (cf. F. Haus- 
dorff ^L p. 3ti3]). Mention should be made also of the elegant generalizations of 
Theorem 1.1, obtained successively by Il.Blumberg [1], M. Schmeiser[l] and 
V. Jarnik [3]. 

§ 2. Contingent of a set. We have mentioned earlier (in 
Chapter IV, p. 133), that certain theorems on derivates of functions 
may be stated as propositions concerning metrical properties of 
sets in Euclidean spaces. In connection with these results, we shall 
state in this § some definitions which begin with some well-known 
notions of Analytical Geometry. 

Uy the direction of a half-line l in a space M m (where m^2) we 
shall mean the system of the m direction cosines of l. The half-line 
issuing from a point a and having the direction 0 will be denoted by 
aft. The half-line issuing from a point a and containing a point bj=a 

will be denoted by ah. 

If w’e interpret the system of the m direction cosines of a half- 
line as a point in Il m (situated on the surface of a unit sphere), we may 
regard the set of all directions in a Euclidean space as a complete, 
separable, metrical space (cf. ('hap. II, § 2). It is then clear what is 
to be understood by the terms: convergence and limit of a se- 
quence of directions, everywhere dense set of directions, etc. We 
shall say further that a sequence of half-lines {/„} issuing from the 
same point a converge x to a half -line l issuing from a, if the sequence 
of the directions of the half-lines l„ converges to the direction of i. 

Given a set E in a space Ii mi a half-line l issuing from a point aeE 
will be called an intermediate half -tangent of E at a, if there exists 
a sequence j a n ) of points of E distinct from a, converging to a and such 

— ► 

that t lie* sequence of half -lines {aa n ) converges to l. The set of all inter- 
mediate half-tangents of a set E at a point a is termed, following 
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G. Bouligand [I], the contingent of E at a and denoted by eontg/s-a 
(by the contingent of E at an isolated point of E, we shall understand 
the empty set). A straight line passing through a which is formed of 
two intermediate half-tangents of E at a is called intermediate tangent 
of E at a. Similarly a hyperplane h passing through the point a is called 
intermediate tangent hyperplane of E at a, if each half-line issuing from 
a and situated in h is an intermediate half-tangent of E at a. In R 2 
the notions of intermediate tangent hyperplane and inter- 
mediate tangent are plainly equivalent. 

Given in the space R m a hyperplane h, o 1 a , 1 + rt 2 x 2 -)-...-f a„,x m =b, 
(cf. Chapter III, §2) the two half-spaces (half -planes if m = 2) 
a 1 x 1 +a i x 2 +...-\-a n ,x m ^b and a x x x -f a^ 2 a m x m ^b, into which h 
divides R m , will be, termed sides of the hyperplane h. In the case in 
which h is an intermediate tangent hyperplane of a set E at a point a 
and in which, further, the contingent contgfo is wholly situated on 
one side of h, the side opposite to the latter is called empty side of h 
and the hyperplane h is termed extreme tangent hyperplane of E at a. 
The two sides of h may, of course, both be empty at the same time, 
and this occurs if the contingent of E at a coincides with the set of 
all half-lines issuing from a which lie in the hyperplane h itself. 
The hyperplane h is then termed unique tangent hyperplane, or simply, 
tangent hyperplane, of E at a. 

For simplicity of wording, wc shall restrict ourselves in the sequel 
to the case of sets situated either in the plane R 2 or in the space R 3 . 
Needless to say, the extension to any space R m presents no essential 
difficulty (an elegant statement, which sums up the results of §§ 3 
and 13 of this chapter and which is valid for an arbitrary space R m , 
will be found in the note of F. Roger [2]). 

As usual, the hyperplanes in R 2 and R 3 are termed straight 
lines and planes respectively. Moreover, in the case of plane sets we 
shall speak of tangent (intermediate, extreme, unique) in place of 
tangent straight line (intermediate, extreme, unique). 

We shall discuss the case of the plane (§ 3) and that of the space 
(§ 13) separately, although the proofs of the fundamental theorems 
3.6 and 13.7 which correspond to these two cases, are wholly analogous. 
The proof of the former is, however, more elementary, whereas the 
latter requires some subsidiary considerations connected with the 
notion of total differential (of. below § 12). 
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§ 3. Fundamental theorems on the contingents of plane 
sets. For brevity, we shall say that the contingent of a plane set 
E at a point a is the whole plane , if it includes all half-lines issuing 
from this point. Similarly the contingent of E at a point a will be 
said to be a half plane, if E has at this point an extreme tangent l 
and if contg consists of all the half-lines issuing from a and 
situated on one side of l. 

We shall see in this § that, given any plane set E, at each point a of E except 
at most in a subset of zero length, either 1° the contingent of Pis the whole plane, or 
2° it is a half-plane, or finally 3° the set E has a unique tangent. This result (together 
with the more precise result contained in Theorem 3.6) was first stated by 
A. Kolinogoroff and J. Vercen ko [1;2], It was rediscovered independently, and 
generalized to sets situated in any space H m , by F.Roger [2]. The proofs, together 
with some interesting applications of the theorem of Kolinogoroff and Verienko, 
will be found in the notes of U. S. H aslam-J ones [2;3]. (For the first part 
of Theorem 3.6 cf. also A. S. Besicovitch [4].) 

A finite function of a real variable F, defined on a linear set E , 
is said to fulfil the Lipschitz condition on E , if there exists a finite 
number X such that \F(x 2 )— ■F(a? 1 )|<iV'-|a? 2 ~-a? 1 | whenever x x and x 2 
are points of E. As we verify at once, we then have A {B(F; ./?)}< 
<( A T -f l)-|Fj (for the notation, cf. Chap. II, § 8, and Chap. Ill, § 10). 
Thus, if a function F fulfils the Lipschitz condition on a set E of finite 
[zero] outer measure , its graph B (F;E) on E is of finite [zero] 
length . 

It is also easy to see that any function which fulfils the Lipschitz 
condition on a linear set E , can be continued outside E so as to fulfil 
the Lipschitz condition on the whole straight line Ifj and so as to be 
linear on each interval contiguous to E. 

(3.1) Lemma . Let R be a plane set , 6 a fixed direction and P the set 
of the points a of R at which contg* a contains no half dine of direction 6. 
Then (i) the set P is the sum of a sequence of sets of finite length , 
and (ii) at each point a of P, except at most at those of a subset of length 
zero , the set R has an extreme tangent stich that the side of the tangent 
containing the half-line ad is its empty side . 

In the particular case in which 6 is the direction of the positive semi- 
axis of y, the set P is expressible as the sum of an enumerable infinity of 
sets each of which is the graph of a function on a set on which the 
function fulfils the Lipschitz condition . 
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Proof. By changing, if necessary, the coordinate system, we 
may suppose in both parts of the theorem that 6 is the direction of 
the. positive semi-axis of y. Let us denote, for every positive inte- 
ger n, by P„ the set of the points (x, y) of P such that the inequali- 
ties \x' — x\^lfn and \y' — imply y' — yCn-\x' — for every 
point ( x',y ') of R. Since there is no point a of P at which the contingent 
of R contains the half-line with the direction of the positive semi-axis 
of y, it is clear that P=£P n - Let us now express each P„ as the sum of 

n 

a sequence {P„,k)i> 1.2, of sets with diameters less than 1 /m. We shall 
then have |»/ 2 — -j?,! for every pair of points, (ir,, y { ) and 
(x 2 , y 2 ), belonging to the same set P„,*. Let Q„, k be the orthogonal 
projection of P„,* on the axis of x. We easily see that each point of 
Q„, k is the projection of a single point of P,,,*. Consequently, the set 
P„,b may be considered as the graph of a function P,,.* on <?„,*. Moreover 
we have \F„ ik (xz)~ P n ,*(.ri)[<n-|j'- 2 — Xi| for each pair of points and 
x 2 of Q„ ik , i. e. the function P„ t * fulfils the Lipsehitz condition on Q„ ik 
and therefore (cf. above p. 264) each set P„, k —B(F„y Q„_ k ) is of 
finite length. Thus, since P= Vp,,*, we obtain the required expres- 

n,k 

sion of the set P as the sum of an at most enumerable infinity of sets 
of finite length, which are at the same time graphs of functions 
fulfilling the Lipsehitz condition on sets situated on the .r-axis. 

It remains to examine the existence of an extreme tangent to 
the set R at the points of P. For this purpose, let us keep fixed for the 
moment a pair of positive integers w and k, and let Q„ ik be the set of 
the points of Q„ tk wliich are points of outer density for Q„ tk and at 
which the function F„, k is derivable with respect to the set Q„, k . Since 
the set Qn.k—Qn.k is of measure zero (cf. Theorem 4.4, Chap. VII) and 
since the function F n , k fulfils the Lipsehitz condition on Q n , k , it follows 
that A\B( F n y, Q„, k -Q„, k )] = 0. 

We need, therefore, only prove that R has an extreme tangent 
at each point of the set B (F„y, Q„ lk ) and that, further, the side of 
this tangent which contains a. half-line in the direction of the 
positive semi-axis of y is its empty side. 

Let (f 0 , rj 0 ) be any point of B(F„, *;<?„,*), and the derivative 
of F„, k at | 0 with respect to the set Q„, k . Let e be a positive number le§s 
than 1. Since f 0 is a point of outer density for the set <?„,*, we can 
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associate with each point (f, rj), sufficiently close to (£ 0 , r) 0 ), a point 
S’eQn.k such that 

(3.2) and (3.3) 

(for otherwise, the outer lower density of Qnj, at f 0 would not ex- 
ceed 1 — e). 

Remembering now that y 0 =F n A (f 0 ), let us write for brevity 
*>jLn=*jLr)-ti o-Ao-(r-fo). 

We shall have 

(3.4) ^- % -A 0 (^-f 0 )=2> nit (f') + [»?- ^.*(f')] + il 0 -(f'-f). 

Now suppose that the point (f, rj) belongs to R and that 
|f— f 0 |<l,/2w 2 and \rj— rj 0 \^.l/2n 2 . By (3.3), we have |f'— £|<l/n, 
and, by (3.2), !-!%(?')— J? 0 | < £ 0 |<l/2«, so that 

\F„At')— 1/m- Since the point (f',F„. *(£')) belongs to P n ,kCP», it 
follows from the definition of the set P„ that y — P„, *(£')<«• 
and using (3.3) again, we derive, from (3.4) that 

i?-ij 0 -A 0 .(f-foKiD M (ni+(»+i^j)-if'-fi< 

Now as f, and therefore £', tends to f 0 , the ratio /)„,*(£')/(£'— f 0 ) 
tends to zero; the same is therefore true, on account of (3.3), of the ratio 
-On, *(£')/(£ — £<>)• Consequently, since e is an arbitrary positive number 
less than 1, it follows from (3.5) that the upper limit of the ratio 
lV-%— SJ» as tbe P° int tends to 

is non-positive. Further, since the line y— r] 0 =A 0 -(x— f 0 ) is plainly 
an intermediate tangent of the set B(P„,*; Q n ,k)(ZR at the point (£ 0 , »? 0 ), 
we see that this line is an extreme tangent of the set R at this point 
and that the half-plane y—Tj 0 ^A 0 (x — f 0 ), which contains the half-line 
issuing from (f 0 , %) in the direction of the positive semi-axis of y, 
is an empty side of this tangent. 

This completes the proof. 

(3.6) Theorem. Given a plane set R, le* P be a subset of R at no point 
of which the contingent of R is the whole plane. Then (i) the set P is the 
sum of an enumerable infinity of sets of finite length and (ii) at every 
point of P, except at those of a set of length zero, either the set R has a unique 
tangent or else the contingent of R is a half-plane. 



Fundamental theorems on the contingents of plane sets. 


267 


[§ 3 ] 


Proof. Let (0„} be an everywhere dense sequence of directions 
in the plane and, for each positive integer n, let P„ denote the set of 
the points of P at which the contingent of R does not contain the half- 
line of direction 0„. We clearly have P=%P n , and by the preceding 

n 

lemma each set P„, and therefore the whole set P, is the sum of a se- 
quence of sets of finite length. Further, the same lemma shows that 
the set R has an extreme tangent at every point of P, except at most 
in a set of length zero. 

Now let Q be the set of the points of P at which 1° there exists 
an extreme tangent which is not a unique tangent and 2° the con- 
tingent of R is not a half -plane. For each positive integer n, let Q„ 
denote the set of the points b of Q such that the half-line b0„ is situated 
on the non-empty side of the extreme tangent of R at b, but does 
not belong to contg w b. Plainly Q=YQ„. Now, by the preceding 

n 

lemma, for every point beQ n , except at most those of a set of length 
zero, the half-line bO n is situated on the empty side of the extreme 
tangent at b. It follows that all the sets Q n , and therefore also the 
whole set Q, are of length zero. Hence, at every point of P, except 
perhaps those of a subset of length zero, either there is a unique 
tangent or the contingent at this point is a half-plane. 

(3.7) Theorem. Given a plane set R, let P be a subset of R at every 
point of which the set R has an extreme tangent parallel to a fixed 
straight line I). Then the orthogonal projection of P on the line at right 
angles to D is of linear measure zero. 

Proof. We may clearly assume that the line D coincides with 
the axis of x. Let S and T denote, respectively, the sets of the points 
(f, rj) of P for which the half-planes and y^g are respectively 
the empty sides of the extreme tangents. Consider the former of 
these sets. By Lemma 3.1, the set 8 is the sum of a sequence of 
the sets B(F n ;Q n ), where the Q n are sets on the .r-axis and the F„ 
functions fulfilling the Lipschitz condition on these sets, respectively. 
We may suppose (of. p. 264) that each function P n is defined, and 
fulfils the Lipschitz condition, on the whole x-axis and is linear 
on the intervals contiguous to the set Q n . 

This being so, we easily see that, for every », the relation 
P,7(x)>0 (x) holds at each point x of Q„ which is not an isolated 
point on any side for Q n , i.e. (of. §1, p. 260) at all the points of Q „ , 
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except at most those of an enumerable set. Thus by Lemma 6.3, 
Chap. VII, we have |F n [(J n ]| = 0 for every positive integer n, and 
since the projection of aS on the y-axis coincides with the sum of 
the sets this projection is itself of measure zero. By sym- 

metry, the same is true of the projection of the set T on the 
y-axis, and this completes the proof. 

As an immediate corollary, we derive from Theorem 3.6 the following prop- 
osition : 


(3.8) Given a plane set R , let P be a subset of R at each point a of which there exists 
a straight line through a which contains no half -line of contg^a. Then the set R has 
a unique tangent at all the points of P except at most those of a subset of length zero. 

This result can be easily extended to the space (cf. F. Roger [2]) as follows: 


(3.9) Given a set R in the space let P be a subset of R at each point a of which there 
exists a plane through a which contains no half -line of contg^a. Then (i) the set P is 
the sum of an enumerable infinity of sets of finite length and (ii) the set R has a unique 
tangent at all the points of P except at most those of a subset of length zero. 


Proof. Let be an everywhere dense sequence of directions in the space 
For each positive integer h , let P n h denote the set of the points a of P such 
— ► 

that |cos(o6, #„)!>! fh for every point b of R distant less than 1 fh from o. We 
express each set P n h as the sum of a sequence { P n h k ) kz j 2 of sets of diameter 
less than 1/A. We then have 


(3.10) 


P= 


?P nh =Zv 

— r n.h -T-' 
n,h n,h,k 


n,h,k’ 


Keeping, for the moment, the indices n. A, k fixed, we choose a new system of 
rectangular coordinates, taking for the positive semi-axis of z the half-line of 
direction H n . Let a, /? and y be, respectively, the three positive semi-axes of the new 
coordinate-system. For any set, or any point, Q, we denote by Q^ a \ and the 
orthogonal projections of Q on the planes fly, ya and aft, normal to the axes a, p 
and y respectively. 

We have |cos(afr, y)|> 1/A whenever aeP nhk , beR and 0<e(a, 6)< 1/A. It 
follows at once that there is no point P^*\ k at which the contingent of the plane set 
R contains a half-line at right-angles to the semi-axis y. Hence, by (3.8), 
the set P^ k is the sum of an at most enumerable infinity of sets of finite 
length, and the set R^ u) has a unique tangent at all the points of P^\ k except at most 
those of a set M n h k of length zero. Similarly, the set R^ has a unique tangent at 
all the points of P^\ k except at most those of a set N n hk of length zero. It follows 
that the set R has a unique tangent at each point a of P n h k , except perhaps when 
or when e N n h k . Now we easily see that the two ratios 

Q{a, b)/(>(a {rt \ 6 (ft) ) and $(a, b)/e(a^\ b (f<?) ) remain bounded (by A) when a and b 
belong to the set P n h h . It follows that the set of the exceptional points of P n h k 
at which the set R has no unique tangent is, with the sets M n h k and N n h k , of 
length zero. For the same reason, since the set P{“J k is the sum of an at most 
enumerable infinity of sets of finite length, so is also the set P„ This com- 
pletes the proof, on account of the relation (3.10). 
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§ 4. Denjoy’s theorems. We shall apply the results of the 
preceding § to establish certain important relations, valid almost every- 
where, which connect the Dini derivates of any function what- 
soever, and which are known by the name of the Denjoy relations. 
For simplicity of wording, we agree to call opposite derivates of 
a function F at a point x 0 the Dini derivates F+(x 0 ) and jF~(# 0 ), 
or else F + (x 0 ) and F (# 0 )- 

We shall begin with some preliminary remarks. 

Let F be a finite function defined in a neighbourhood J of 
a point x Q and let B denote the graph of F on J. It is clear that if 
the function F is derivable at the point x 01 the set B has at (x 0 ,2' 7 (#<>)) 
a unique tangent not parallel to the axis of y. Similarly, if two op- 
posite derivates of F are finite and equal at x 0J the set B has at 
(a? 0 ,F(:r 0 )) an extreme tangent y —F(x 0 )—k(x—-x 0 ), whose angular 
coefficient k is equal to the common value of these derivates. Con- 
versely, if at the point (# 0 ,F(# 0 )) the set B has the extreme tangent 
y —F(x 0 ) = k(x — .r 0 ) where /c=j=oo, then 1° F+(x 0 )=F~~(x 0 ) = k in the 
case in which the half-plane y—y 0 ^k (x—x {) ) is an empty side of 
this tangent and lim mpF(x)^F(a* 0 ) f and 2° F { (x 0 )=F (x 0 )~k in 

x->x u 

the case in which the half-plane y — y 0 ^.k(x — x 0 ) is an empty side 
and lim inf F(x)^F(x 0 ). 

X->X n 

In the enunciations of the theorems which follow, we shall 
frequently be concerned with exceptional sets E, connected with 
a function F and subject to the condition A {B(F; E)}=^0. This 
condition evidently implies both |2£| = 0 and = since the 

sets E and F[E] are merely the orthogonal projections of the set 
B(F;E) on the x - and y-axes, respectively. 

(4.1) Theorem. If at each point of a set E one of the extreme uni- 
lateral derivates of a function F is finite , this derivate is equal to its 
opposite derivate at every point of E except perhaps at the points of 
a set E x of measure zero such that A{B(FyE 1 )}—0. 

Proof. We may clearly suppose that the same derivate, F+(x) say. 
is the one which is finite throughout E. We thus have lim sup F(x) < F(x 0 ) 

x-+x 0 + 

at every point x 0 eE and, on account of Theorem 1.1 (ii), we may 
even suppose that lim sup F(x) ^.F {x 0 ) at every point x 0 of E. 

Now, when x 0 eE, the contingent of B (F',E) at the point {x 0 ,F{x 0 )) 
contains no half-line situated in the half-plane x^x 0 and having 
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angular coefficient exceeding F+(x 0 ). Therefore, by Theorem 3.6, 
the set B(FjE) has an extreme tangent at each of its points (x 0 ,F(x 0 )), 
except for those of a subset B x of length zero, and this tangent has the 
half-plane y—y Q ^F+(x 0 )-(x — x 0 ) for its empty side. Hence, denot- 
ing by E x the orthogonal projection of B x on thea?-axis, we see, from 
the remarks made at the beginning of this §, that at every point x 
of the set E — E x the derivates F + (x) and F ~(#) are equal. This com- 
pletes the proof since A{B(F;E 1 ))=A(B 1 ) = 0. 

(4.2) Theorem . If at each point of a set E a finite function F has 
either two finite Dini derivates on the same side, or else a finite extreme 
bilateral derivate (F(x) or F(x )), then the function F is almost every- 
where derivable in E ; moreover, denoting by E 0 the set of the points x 
of E at which the function F is not derivable , we have A(B(JP;I? 0 ))^0. 

Proof. It will suffice to consider separately the following 
two cases: 

1° The function F has two Dini derivates on the same 
side finite at each point of E . We then have, by Theorem 4.1, 

(4.3) F + (x)=F~(x) and F + (x)=F~(x) 

at each point x of E, except perhaps those of a set E 0 such that 
A{B(F) E 0 )) — Q. But the relations (4.3) imply the equality of all four 
Dini derivates at the point x, and since two of them are finite, by 
hypothesis, at each point x of E , the function F is derivable through- 
out E — E 0 . 

2° The function F has an extreme bilateral derivate 
finite at each point of E. By applying twice over Theo- 
rem 4.1, and making use of the obvious relations F~ h (x)^F' h (x) and 
F~(x)^F(x), we see that the four Dini derivates are finite and 
equal at each point of E , except perhaps at those of a set on which 
the graph of F is of zero length. This completes the proof. 

Theorem 4.2 (in a slightly less complete form, it is true) has 
already been mentioned in Chap. VII, p. 236, as a corollary of 
Theorems 10.1 and 10.5, Chap. VII. We have also stated that (as 
a consequence of these same theorems) the set of the points at which 
a function has a unique derivative (even a unilateral derivative) 
infinite, is necessarily of measure zero. We can now extend this 
result by taking the modulus, as follows: 

(4.4) Theorem . For any finite function JP, the set of the points x 
at which lim \F(x-\-h) — F(x)\lh=+ oo, is of measure zero. 
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Proof. Denoting the set of the points in question by i, we 
see at once that the graph of the function F has at every point of 
the set B(F;A), except perhaps at those of a set of length zero, 
an extreme tangent parallel to the y- axis. Thus, by Theorem 3.7, 
the set A y which is the projection of the set B(F;A) on the #-axis, 
is of measure zero and this completes the proof. 

It results, in particular, from Theorems 4.1, 4.2 and 4.4 that the Dini 
derivates of any finite function F satisfy one of the following four relations at almost 
every point x: 1° F^~(x)~ F~(x)~ -foo, F^~(x)—F~(x)= — oo; 2°F^(x)~F~{x) ^oo, 

= — oo, F~(x)—- foo, 3° F+ (x)~F~ (x) -\oo ; F+(x)=+ oo, F~(x)= — oo; 
4° F+(x)=F+(x)=F-(x)=F- '(afj + oo. For direct proofs of this theorem, which was 
established first by Denjoy for continuous functions and then generalized to 
arbitrary functions, vide : A. Denjoy [1], G. C. Young [2], F. Riesz [7], J. Rid- 
der [4], J. C. Burkill and U. S. Haslam-Jones [1], and H. Blumberg [2] (cf. 
also A. N. Singh [1]). A further discussion of the Denjoy relations will be found 
in the notes of V. Jarnik [1] (for functions of one variable) and of A. S. Besi- 
covitch [6] and A. J. Ward [4] (for functions of two variables). For Theo- 
rem 4.4 see S. Saks and A. Zygmund [1] (cf. also S. Banach [1]). 

A part of the Denjoy relations has recently been generalized to differential 
coefficients of higher orders; see the important memoirs of A. Denjoy [9], 
J. Marcinkiewicz and A. Zygmund [1], and J. Marcinkiewicz [2]. 

We may now supplement Lemma 6.3, Chap. VII, by the fol- 
lowing result: 

(4.5) Theorem . Let M be a finite number and F a finite function 
such that |F + (a ?)|< M at every point x of a set E . Then \F[E]\ < M-\E\. 

Proof. Let E 1 denote the set of the points x of E at which 
F~(x)=$=F+(x). By Theorem 4.1, we have A{B(F- J E 1 )}=0 and there- 
fore, |J? t [jE 7 1 ]| = 0- On the other hand, since |,F _ (^)| = M at 

each point xeE~-E u it follows from Lemma 6.3, Chap. VII, that 
\F[E— E x ]\ < M-\E\y and this completes the proof. 

An immediate consequence is the following criterion for a func- 
tion to fulfil Lu sin’s condition (N) (Chap. VII, § 6): 

(4.6) Theorem . If a finite function F has at each point x of a set E 
a finite Dini derivatey the function necessarily fulfils the condition 
(N) on E . 

Proof. It is enough to show that if at each point a? of a set H 
of measure zero the function F has one of its Dini derivates, say, 
finite, then |jP[JQT]| = 0. For this purpose, let H n be the set of the 
points xeH at which |F 4 '(a?)|<n. We have, by Theorem 4.5, 
[WJI = 0 for each positive integer », and hence |J?*[jBr3|=0. 
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It is easy to see that the hypotheses of Theorem 4.5 imply that 
A{B(F; E))^(M+ l)-|i£j. This remark enables us to complete Theorem 6.5 of 
Chap. VII, as follows: If the derivate F+ of a finite function F is finite at every point 
of a measurable set E y except at those of an enumerable subset , then the function F , 
together with its derivate F+ t is measurable on E and we have 

\F[E]\^ f\F + (x)\dx and A{B(F; E)}= l{l+[F + (x)Y\' , *dx. 
e k 

We may note also the following consequence of Theorem 4.5: If one of the 
four Dini derivates of a function F vanishes at every point of a set E , then |FfjE/]|=0. 

For functions F(x) which are continuous, or more generally continuous in 
the Darboux sense (i. e. assume in each interval [a, b] all the values between F(a) 
and F(b)), we deduce at once the following result: 

(4.7) Theorem. If F is a finite function , continuous in the Darboux sense on an 
interval I, and if at each point of this interval , except those of an enumerable set , one 
at least of the four Dini derivates is equal to zero , then the function F is constant on I . 


* § 5. Relative derivates. The Den joy relations can be 
extended in various ways to relative derivates of a function 
with respect to another function. Let us remark that, in accordance 
with the definition given in Chap. IV, p. 108, the extreme derivates 
of any function with respect to a finite function U are determined 
at each point which belongs to no interval of constancy of the func- 
tion ?7; consequently, the set of values taken by the function U 
at the points at which the extreme derivates with respect to V 
remain indeterminate is at most enumerable. 

In the sequel it will be useful to employ the notation adopted 
in Chap. IV, § 8. Let us recall in particular, that if C is a curve given 
by the equations x—X(t), y = Y(t ), its graph on a linear set E (i. e. 
the set of the points (X(t),Y(t)) for teE) is denoted by B(<7;JS). 

(5.1) Lemma . If C is a curve given by the equations x— U ( t ), y = F(t ), 
the set E of the points t at which Fa(t)<+oo f may be expressed as 
the sum of a sequence of sets {E n } such that 

(V) for every n and for every open interval I of length less than 1/n, 
the set B (C;I) has a unique tangent at every point of B {C]I-E„) 
except those of a set of length zero . 

Proof. Let us denote, for each positive integer n, by E n the 
set of the points t such that, provided that the differences F(t') — F(t) 
and U(t')— U(t) do not vanish simultaneously, the inequality \V— t\^.l/n 
implies [F(t’)— F(t)] /[ U(t') — U(t)]^.n. We see at once that, for any 
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open interval I of length less than 1/n, the contingent of B(C)I) 
at a point of B(<7;/-27„) cannot contain half-lines of angular coef- 
ficient greater than n , and can be, therefore, neither a whole plane, 
nor a half-plane (cf. §3, p.264). The property (y) of the sequence 
{E„} thus appears as a direct consequence of Theorem 3.6. 

(5.2) Theorem . If U and F are continuous functions and we have 
Fc(t)<+oo at every point t of a set E , then there is a finite derivative 
F'u(t) at each point t of E , except at the points of a set H such that 
|t/[H]| = 0. 

Proof. Let C denote the curve given by the equations x— U (t), 
y = F(t). On account of Lemma 5.1, the set E is expressible as the 
sum of a sequence of sets { E n } which fulfil the condition (y) of this 
lemma. Keeping fixed, for the moment, a positive integer n, let us 
consider an open interval I of length less than 1/n. Let B n (I) denote 
the set of the points of B (C;I-E n ) at which the graph of the curve 
C on I either has no unique tangent, or else has a unique tangent 
parallel to the y-axis. Further, let B n (I) be the projection of the 
set B n (I) on the #-axis. On account of the condition (y) and 
Theorem 3.7, we have |J3„(/)| = 0. Now since U and F are continuous, 
it is clear that the derivative F'u(t) exists and is finite at each point 
teI E n , provided that U (t) does not belong to the set B„(I). Hence, 
I being any open interval of length less than 1/n, this derivative 
exists and is finite at each point teE ni except at most at the points 
of a set H n such that \U[H n ]\~-0. This completes the proof, since 
E=>jE n . 

n 

(5.3) Theorem . If U is a continuous function and F any finite 
function for which Fu{t)~ 0 at each point t of a set E, then |F[£]| = 0. 

Proof. Let C denote, as in the proof of the preceding theorem, 
the curve x — U(t), y=F(t ), and let E be expressed as the sum of 
a sequence of sets {E„) subject to the condition (y) of Lemma 5.1. 
Keeping fixed, for a moment, a positive integer n, let us consider 
any open interval I of length less than 1/n. At each point of B(C; I -i?*), 
except those of a set of length zero, the set B(C;/) then has a unique 
tangent, and since the function U is continuous and Fu(t) = 0 at 
each point teE s this tangent is parallel to the #-axis. It follows, 
by Theorem 3.7, that the set F [/•£„], which coincides with the 
projection of the set B(C;I-E n ) on the y-axis, is of measure zero. 
Since I is any interval of length less than 1/n, it follows that |F[B„]|=0 
for each positive integer n, and finally that |F[U]l = 0. 
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The hypothesis of continuity of the function U is essential for the validity 
of Theorems 5.2 and 5.3 (the hypothesis of continuity of the function F, which is 
not required in Theorem 5.3, may, however, be removed also from Theorem 5.2). 
Let F(t)~ - 1 identically, and let U(t)=t for irrational values and U(t)=t+l for 
rational values of t. Denoting by E the set of irrational points of the interval (0, 1), 
we shall have at each point t of this set 

?V‘)=^(«)-F £/ (t)=0 and FuW^F+tt)=F i; (t)^—l. 

Nevertheless — |F[25]|==1. On the other hand, the hypothesis of continuity 

of the function U may be removed from Theorem 5.3, if we replace the con- 
dition F u (t)^0 by the more restrictive condition F LJ (t)^ 0. To see this, we shall first 
establish an elementary lemma. 

(5.4) Lemma . If U is a finite function on a set E, there exists a set 

TCE such that the set U[T] is at most enumerable and such that each 

point rtE—T is the limit of a sequence of points {£,} of E which 

fulfils the conditions (i) t,> x and U (t,)+ U (t) for each i — 1 , 2 ,... 

and (ii) lim U(t,)=U(r ). 

/ 

Proof. Let T be the set of the points TtE none of which is 

the limit of a sequence {t,} of points of E subject to the conditions 

(i) and (ii) of the lemma. Let us denote, for each positive integer k, 

by T k the set of the points r of T for which there is no point t e E 

such that both Oct— x<\jk and 0<\U(t)—U(T)\<l/k. We have 

T=£T k . Plainly the function IJ cannot, on any portion of T k of 
* 

diameter less than 1 jk, assume two distinct values differing by less 
than 1/A:. It follows that each set U[T k ] is at most enumerable, 
and the same is therefore true of the whole set U[T\. 

(5.5) Theorem. If U and F are any finite functions and Fu(t) — 0 
or, more generally Fu(t)=F~u(t) — 0, at each point t of a net E, then 

m™I=o- 

Proof. Let V be the curve x= U (t), y=F(t), and let E„ denote, 
for each positive integer n, the set of the. points t of E such that 
the inequality OsgP — t^l/n implies \F(t')— F(t)\ < \U (t‘) — U(t)\ 
whatever be the point t'. We can express each set E„ as the sum 
of a sequence of sets of diameter less than 1/m. 

Let us keep n and k fixed for the moment. Tt is clear that 
the contingent of the set B(C; _£?„,*) cannot, at any point of this 
set, contain a half -line whose angular coefficient exceeds the num- 
ber 1. Consequently, denoting by B„, k the set of the points of B (C;E„, k ) 
at which the set B(C-,E„, k ) has no unique tangent, we see from 
Theorem 3.6 that A(B n<k )=Q. 
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Now the set E„, k contains, by Lemma 5.4, a subset T„, k such 
that TJ[T„'k\ is at most enumerable and that each point reE„ fk — T„, k 
is the limit of a sequence {<,•} of points of E„, k which fulfils the con- 
ditions (i) and (ii) of this lemma. Hence, the relations Fu(t)—F(j(t)= 0 
being satisfied, by hypothesis, at each point t e E n , k , the set B (C;E„, k ) 
has a unique tangent parallel to the aj-axis at each point of the 
set B (C-,E„' k —T U ' k ) — B„ ik . Since d(B„,*) = 0, it thus follows from 
Theorem 3.7 that the set F[E nk — T„,*], which coincides with the 
projection of the set B (C-,E„, k —T„, k ) on the y-axis, is of measure zero. 
The same is therefore true of the set F[E „, k ], for the set F[T„ ik ] 
is, with U[T„ tk ], at most enumerable. It follows at once that 
|J’[^]| = 0, since E=Z E„, k . 

n,k 

We may mention an application of Theorems 5.3 and 5.5, which is connected 
with the following theorem of H. Lebesgue [II, p.299]: If the derivative of a con- 
tinuous function F , with respect to a function U of bounded variation, is identically 
zero , then the junction F isaconstant. J. Petrovski [1] and R. Caccioppoli [1] 
extended this theorem, in the case when the function U is continuous, by removing 
the hypothesis of bounded variation for U. At the same time, Petrovski remarked 
that it was sufficient for the validity of the theorem to suppose that the relation 
holds everywhere except in an enumerable set. 

It is easy to see that this result is contained in each of the separate theorems 
5.3 and 5.5. These theorems actually enable us to state the result of Petrovski 
and Caccioppoli in two slightly more general forms. Thus: 

1° Suppose that U and F are continuous functions and that at each point t , 
except at most those of an enumerable set , one at least of the four relations 
F u (t)= 0, F a (t)= 0, J'+«)=J’ + (<)=0 or F u (t)=F (J (t)^0 is fulfilled. Then the 
function F is a constant. 

2° Suppose that U is any finite function and F a continuous function , and let 
one of the relations Fb(t)--=F+(t)=0 or F ( j(t)=F y{t)=Q hold at each point t except 
at most those oj an enumerable set. Then the function F is a constant. 

We observe further that, in both the statements 1° and 2°, we may replace 
the hypothesis of continuity of F by the hypothesis that F is continuous in the 
Darboux sense (cf. §4, p. 272); moreover the condition that the exceptional set be 
at most enumerable may be replaced by the condition that the set of values assumed 
by the function F at the points of this set be of measure zero. 

The Denjoy relations have a more complete extension to rela- 
tive derivates when the function V of Theorem 5.2 is subjected 
to certain restrictions. Thus: 

(5.6) Theorem. Let U and F be finite functions , and suppose that , 
at each point t of a set E y the derivative U'{t) (finite or infinite) exists 
and that Fu(t)<-\- oc. Then Fu(t)=Fu (t) 4=00 at each point t of E 
except at most the points of a set H such that \U[H]\ — 0. 
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Proof. We may clearly restrict ourselves to the case in which 
the derivative U'(t) is non-negative throughout E, and even, by 
Theorem 4.5, to the case in which (1°) U'( t)> 0 at each point t 
of E. We then have limsup U(t)^ (/(rXlim inf U(t) at each point 

t->r — 

Tf E, and consequently, on account of Theorem 1.1 (ii), we may 
suppose that (2°) the function U is continuous at each 
point of E. This implies that we then have also lim supi^(<Xf 1 (r) 

at each point r of E, and hence, appealing again to Theorem 1.1 (ii), 
we may suppose further that (3°) lim m])F(t)^F(r) at each point 

t-+i 

rtE. Finally by Theorem 1.2, we may suppose (4°) Fu(r)^.Fif{r) 
at each point Tf E. 

Let now C be the curve defined by the equations x— U (f), 
y--F(t). We denote, for each positive integer to, by E„ the set 
of the points tf-E such that, for every point t', (i) the inequal- 
ity 0<<' — t<l/n implies the two inequalities U(t')>U(t) and 
F(t') — F(t)<n-[U(t') — U(t)], and (ii) the inequality 0<t—t'<l/n 
implies U(t)>U(t'). 

Since, by hypothesis, Fu(t)<- foo and since, by (1°), Z/'(<)>0 
at each point t of E, we see that E=£E n . 

n 

Keeping a positive integer to fixed for the moment, let I be 
any open interval of length less than 1/n. Whenever (£. rj) is a point 
of B(f;I-Z?„), the contingent of the set B(C’;Z) at (f, t]) clearly con- 
tains no half-line which is situated in the half-plane and which 
has an angular coefficient exceeding to. Let D(I) denote the set 
of the points of the set B(t7;Z) at which this set has an extreme 
tangent, non-parallel to the y-axis, with an empty side containing 
the half-line in the direction of the positive semi-axis of y. Further, 
let B„{ l ) be the set of the points of B((7;Z-F„) which do not belong 
to Z)(Z), and let H„(I) be the projection of B„[I) on the .T-axis. 
By Theorems 3.6 and 3.7, the set B„(I) is of measure zero. 

This being so, let t 0 be any point of the set I-E„ such that 
U(t 0 ) does not belong to the set Let us denote by k 0 the 

angular coefficient of the extreme tangent to the set B (C\l) at 
the point ( U (t 0 ),F(t 0 )). It follows easily from the hypotheses (1°), 
(2°) and (3°) that FJ(t 0 )^k 0 ^Fu(t 0 ),- and tnis, in view of (4°), leads 
to the relation Fu(t 0 )—Fv(t 0 ) 4= o°. 
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Thus, since 1 is any open interval of length less than 1 In, we 
find that the last relation holds at each point t 0 of E„ other than 
those belonging to a set H„ such that 1 17 [#„]] = <). This completes 
the proof, since we have seen that E=£E n . 

n 

In view of Theorem 7.2, Chap. VI I, we derive from Theorem 5.6 the following 
theorem which has been established in a different way by A. J. Ward [3]: 

(5.7) Suppose that the function U is VBG* and let F be any finite function . Let E 
be a set at each point t of which we have either Fy(t)<+oo or Fy(t )> — oo. Then the 
derivates FJJ and F^j are finite and equal at all points of E except at most those of a set 
H such that |I7[£f]|=0. 

It will result from the considerations of § 6 (see, in particular, Theorem 6.2) 
that Theorem 5.7 remains valid for all continuous functions U which fulfil the 
condition (Tj). Nevertheless, its conclusion ceases to hold if we allow U to be any 
function which is VBG or even ACG. To see this, let G be a non-negative continuous 
function which is ACG on the interval [0, 1] and for which G(t)=0 and G~(t)< — 1 
at any point t of a perfect set E of positive measure (for the construction of such 
a function cf. Chap. VII, § 5, p. 224). Let us choose I7(f)=t-f G(t) and F(t)—t. We 
shall then have at every point t of E, U(t)—t , U^(t)^>\ and U~(t)<0<U ( t ), 
so that 0':Fy(t)‘'F^(t) - 1 , while Fjj(t)= — oo and .F~(()^-foo. Nevertheless 
|tf[£]H£|>0- (This example is due to Ward.) 

*$6. The Banach conditions (TJ and (TJ. A finite func- 
tion of a real variable F is said to fulfil the condition (T t ) on an 
interval I if almost every one of its values is assumed at most 
a finite number of times on I. A finite function F is said to fulfil the 
condition (T 2 ) on an interval I if almost every one of its values 
is assumed at most an enumerable infinity of times on I. 

These two conditions were formulated by S. Banach [6]. We shall begin 
by studying the condition (T x ) and we shall establish a differential property which 
is equivalent to this condition in the case when F is continuous ( vide below 
Theorem 6.2). Another equivalent condition, due to Nina Bary, will be estab- 
lished in § 8 (Theorem 8.3). 

(6.1) Lemma . Suppose that F is a continuous function and that E 
is a set at no point of which the function F has a derivative ( finite 
or infinite). Suppose further that each point x of E is an isolated 
point of the set E[J 1 (<) = ^(^)]. Then A{B(jF;1?)} = 0, and conse- 

t 

quently |2?| = \F[E]\ = 0. 
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Proof. For each xe E there exists a neighbourhood I such 
that, when tel, the difference F(t)—F(x) remains of constant sign 
as long as t remains on the same side of x; this difference then changes 
sign as t passes from one side of x to the other, except in the 
case in which the function F assumes a strict maximum or mini- 
mum at x. Therefore, if we denote by E 0 the set of the points at 
which the function assumes a strict maximum or minimum, we 
see at once that the four Dini derivates of F have the same sign 
at any point x of E — E 0 . In other words, since, by hypothesis, the 
function F has no finite or infinite derivative at any point of E, 
we shall have at each point x of E—E 0 either +oo> F(x)^0 or 
else — oo<F(x)^Q. Hence, by Theorem 1.2, A(li(F-, E — E„)} — 0, 
and, since the set E 0 is at most enumerable (cf. Theorem 1.1), it 
follows that A[B(F‘,E)) = Q. 

(6.2) Theorem. In order that a function F which is continuous 
on an interval I, fulfil the condition (T x ) on this interval, it is neces- 
sary and sufficient that the set of the values assinned by F at the points 
at which the function has no derivative (finite or infinite ) be of meas- 
ure zero. 

Proof. Denoting by Y the set of the values assumed an in- 
finity of times by the function F on I, and denoting by E the set 
of the points of I at which F has no derivative, we have to prove 
that the relations Y = 0 and ,F[£]| = 0 are equivalent. 

1° I'=o implies F[E]‘. = 0. Let X be the set of the points 
x*I such that F(x)eY. Then F[X~\=Y, whence |F[Z]| — 0. 

On the other hand, for each x 0 e E — X, the set of the points x 
such that F(x) -F(x 0 ), is finite, and consequently an isolated set. 
It follows from Lemma 6.1 that \F[E — A']] = 0, and hence finally 
that F[E]^ F[X]4-F[E-X]\^0. 

2° ,F[E\ — 0 implies jl'i = 0. Let H denote the set of the 
points x at which F'{ x) =:0. By Theorem 4.5, we have |F[H]| = 0. 

Now let y 0 be any point of Y--F[E], and let E 0 denote the 
set of the points x at which F(x) — y 0 . The set E u being infinite and 
closed, let x 0 be a point of accumulation of Since the function F 
has a derivative at each point of E 0 , w r e find that. F'(x 0 )= 0; thus 
x 0 fH and therefore y 0 f F[H\. It follows that Y —F[E](ZF[H], 
and hence that | F -F[A’]|= 0. Thus iF[£]| = 0 implies |F| = 0 
and the proof is complete. 
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(6.3) Theorem. 1° A continuous function which is VBG, (in par- 
ticular, one of hounded variation) on an interval I, necessarily fulfils 
the condition (T,) on I. 

2° A continuous function which is VBG on an interval I neces- 
sarily fulfils the condition (T 2 ) on I. 

Proof. On account of Theorem 7.2, Chap. VII, the first part 
of the theorem is an immediate corollary of Theorem 6.2. To 
establish 2°, let us suppose that F is continuous and VBG on an in- 
terval I. The interval I is then expressible as the sum of a sequence 
{E„) of closed sets on each of which the function F is VB. We may 
clearly suppose that each E„ contains the end-points of the interval I. 
Let us denote, for each positive integer n, by F„, the function which 
is equal to V on E„ and which is linear on the intervals contiguous 
to E„ The functions F„ are plainly of bounded variation on I, and 
therefore, by 1°, they fulfil the condition (T t ). It follows at once 
that the function F fulfils the condition (T 2 ) on I. 

In the part of Theorem 6.3(1°) that applies to functions which are VBG*, 
the continuity hypothesis for the function F is not a superfluous one (thus, the 
function F(x)— sin( \/x) for x t 0 and F(0)=0 is VBG* and does not fulfil the condi- 
tion (T,) on [0, 1]). This hypothesis may however be replaced by a weaker one, 
which consists in supposing that the function F has no points of discontinuity 
other than of the first kind (i. e. that, at each point x , both the unilateral limits 
F(x-f-) and F(x-) exist). In particular, functions of bounded variation, whether 
continuous or not, all fulfil the condition (Tj) (and from this it follows easily that 
the continuity hypothesis may be removed altogether from the second part (2°) 
of the theorem). 

For functions of bounded variation, the condition (T,) may also be deduced 
from the following general property of plane sets, established by W. Gross [1] 
(cf. J. Gillis [1)): // E is a plane set and E n denotes the set of the values of r t such that 
the line y~t] contains at least n distinct point * of the set E , then .\(Ey? 

In connection with part 2° of Theorem 6.3, it may be noted further that 
functions which are VBG, or even ACG, need not fulfil the condition (T,). An 
example is furnished by the function V considered in §5, p. 277. The latter is also, 
as will follow from results to be established in § 7 (cf. in particular. Theorem 7.4), 
an example of a continuous function which is ACG, and consequently fulfils the 
condition (N), without fulfilling the condition (S) of Banach. 

For continuous functions of bounded variation, the condition 
(TJ is also a consequence of the following theorem of S. Banach [5] 
(cf. also N. Bary [3, p. 631]), which contains at the same time an 
important criterion for a continuous function to be of bounded 
variation: 
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(6.4) Theorem. Let F be a continuous function on an interval 
/ 0 =[a,6] and let s(y) denote for each y the number (finite or infinite) 
of the points of 1 0 at which F assumes the value y. Then the function 
s(y) is measurable (33) and we have 

■foe 

(6.5) fs(ij)dy=W(F-,I 0 ). 

— oo 

Pro of. For each positive integer »,let us put /i n) =[a,a+(6— o)/2 n ] 
and I { M n) =(a+(k-l)(b-a)!2",a+k(b-a)l2"], when *=2,3,. ..,2". 
This defines a subdivision 3 < " > of the interval I 0 into 2" subintervals, 
of which the first is closed and the others are half-open on the 
left. For * = 1,2, ...,2", let $*'' denote the characteristic function of 

the set F [/*"’], and let s M (y)=^s^\y). 

k \ 

We see at once that the functions s in) {y) constitute a non* 
decreasing sequence which converges at each point y to s(y). Hence, 
the functions s {n) (y) being clearly measurable (*33), so is also the func- 
tion s(y). +oo 

On the other hand, /V*" ) (i/)dv = |F[7 < * ) ]! =0(F;I$," ) ). Therefore, 

oo 

denoting by \V {n) the sum of the oscillations of the function F on the 

- 1-00 

intervals of the subdivision 3 <n) > we obtain j 8 {,, \y) = W ( " , and the 

oo 

relation (6.5) follows by making n-+ oo. 

(6.6) Theorem . If F(x) is a continuous function which fulfils the 
condition (T 2 ) on an interval / 0 , the set I) of the points at which the 
derivative F'(x) (finite or infinite) exists , is non enumcrably infinite. 

Moreover , if we write 

P — E[x e D\ F\x)^ 0] and N=E[xeD] F'[x)^ 0], 

-r x 

then, for each interval I — [a,b] QI 0 , we have 
(«.7) -|F[,VKF(i)-F(aK|F[Pl|. 

Proof. We may, plainly, suppose that 
(«•«) F(a)^F(b), 

since the other case may be discussed by changing the sign of the 
function F. 

Let Y be the set of those values of F on I which are assumed 
by the function F at most an enumerable number of times on I. 
Denoting, for each y, by E u the set of the points xel such that 
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F{x) — y, we shall show that with each point ye Y we can associate 
a point x y eE y , in such a manner that (i) F(x y )^0 and (ii) x y is 
an isolated point of the set E y . 

For this purpose, we remark first that if the set E y reduces 
to a single point, the latter may be chosen for our x y . For, in that 
case, the condition (ii) is clearly fulfilled, while the condition (i) 
holds on account of the hypothesis (6.8). 

Let us therefore consider the other case, in which the set E,, 
contains more than one point. Then, since the function F is con- 
tinuous by hypothesis and yeY, the set E y is closed, and at 
most enumerable. This set, therefore, contains a pair of isolated 
points a, p between which it has no further points. (This is obvious, 
if the set E„ is finite. If E y is infinite, its derived set (of. Chap. II, 
p. 40) is itself closed, non-empty, and at most enumerable; the 
latter, therefore, contains an isolated point x 0 . Thus near x 0 there 
are only isolated points of E y . It will, therefore, suffice to choose, 
among the latter, any two consecutive points as our points a and /?.) 
Consequently, at one at least of the points a and ft, the upper deri- 
vate of F(x) is non-negative. We choose this point as our x y . We 
then see at once that the conditions (i) and (ii) are fulfilled. 

This being established, let X denote the set of all the points 
x y which are thus associated with the points y ^ Y. It follows 
from the conditions (i) and (ii) and from Lemma 6.1, that 
\F[X — P]| = |F[AT — D]| = 0, and so, by definition of the set A’, 
that |Y|HF[X]| = |F[X-P]|<|F[P]. Since the condition (T 2 ) im- 
plies that |F[/]| = |Y|, we obtain, in view of (6.8), the inequality 
-|P[^]|<0^P(ft)-P(aK|P[Z]|<|P[P]|, i.e. the inequality (6.7). 

Finally, since this relation holds for every subinterval [ a,b ] 
of Jg, we see that, unless the function F is a constant, one at least 
of the sets F[N] and P[P] is of positive measure. The set D—X+P 
is thus non-enumerably infinite, and this completes the proof. 

(6.9) Theorem . Let F be a continuous function which fulfils the 
condition (T 2 ) and let g be a finite summable function . Suppose further 
that F'(x)^g(x) at each point x at which the derivative F'(x) exists , 
except perhaps those of an enumerable set or , more generally , those of 
a set E such that |P[2?]| = 0. Then the function F is of bounded vari- 
ation and , for each interval [a, ft], we have 

b 

P(6)-P(a)< f F\x)dx. 


( 6 . 10 ) 
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Proof. Let P be the set of the points x of [a, ft] at which the 
derivative F'(x) exists and is non-negative. Then, since at each point 
x e P — E we have Q^.F’(x)^.g(x)< + oo, it follows from Theorem 6.5, 

b 

Chap. VII, that |jP[P— 2?]|^ f F'(x) dx < j\g(x)\dx. On the other 

Pa 

hand, bv hypothesis, JP’f | = 0. Hence, on account of Theorem 6.6, 

b 

F(b)-F(a) ^ / \g(x ) ' for each interval [ft, ft], and, in conso- 
rt 

quence, F is a function of bounded variation whose function of 
singularities is monotone non-increasing. The inequality (6.10) fol- 
lows at once. 

In view of Theorem 6.3(2°), we may apply Theorem 6.9, in particular, 
to continuous functions F which are VBG. We also observe that Theorem 6.9, 
when F is of bounded variation, may be deduced from de la Vallee Poussin’s 
Decomposition Theorem (Chap. IV, § 9). 

Theorem 6.9 may be generalized further, by replacing the condition that 
the function g is summable, by the condition that the latter is ‘^-integrable 
(the function F then shows itself to be VBG*). We thus obtain a proposition similar 
to Theorem 7.3, Chap. VI. The proof of Theorem 6.9 in this generalized form is, 
however, more complicated. 

*§ 7. Three theorems of Banach. We have repeatedly 
emphasized the importance of Lusin’s condition (N) in the theory 
of the Denjoy integrals. We shall show in this §, that every con- 
tinuous function which fulfils the condition (N), also fulfils the con- 
dition (T 2 ). This result due to S. Banach [6] (cf. also N. Bary 
[3, p. 195]) renders Theorems 6.6 and 6.9 applicable to functions 
which fulfil the condition (N). 

We shall also study another condition, introduced by 
S. Banach [6] and termed condition (S). We say that a finite function 
F fulfils the condition (S) on an interval if to each number e>0 
there corresponds an tj>0 such that, for each measurable set 
ECI 0 , the inequality \E\<rj implies |P[P]|<e. (This condition is 
essentially more restrictive than the condition (N); cf. the remarks, 
p. 279, also G. Fichtenholz [4].) 

(7.1) Lemma. Given a function F which is continuous on cm in- 
terval I, every closed set ECl contains a measurable set A on which 
the function F assumes each value yeF[E] exactly once. 
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Proof. With each y e F[E] we associate the lower bound x y 
of the set of the points x of E at which F(x)=y, and we denote 
by A the set of all the points x y which correspond in this way to 
the values y eF[.E]. Since the set E is closed, we plainly have ACE 
and F assumes on A each of the values y eF[F] exactly once. 

In order to establish the measurability of A, let us denote, 
for each positive integer n, by E„, the set of the points x f E such 
that E contains at least one point t which is subject to the conditions 
F(t) = F(x) and x—t^l/n. We have A = E — ]^E„, where E is closed 

n 

by hypothesis, and. where each E n is closed by continuity of F. 
The set A is thus measurable and this completes the proof. 

(7.2) Lemma . Lei F be a continuous function which fulfils the con - 
dition (N) on an interval /. Then 

(i) every measurable set EC I contains , for each e>0, a meas- 
urable subset Q , such that | F[E] — jP[Q]|<£, and on which the func- 
tion F assumes each of its values at most once\ 

(ii) every measurable set ECI contains a measurable subset E , 
such that \F[E] — i^[/?]| = 0, and on which the function F assumes 
each of its values at most an enumerable infinity of times. 

Proof, re (i). As a measurable set, E is the sum of a set U of 
measure zero and an ascending sequence of closed sets {E n ). Since 
the function F fulfils the condition (N), we have |jP[Zf]| = 0, and 
hence, the sets F[E„] being measurable, there exists a positive in- 
teger n 0 such that \F[E] — .F[2?nJ|<£. Now, by Lemma 7.1, there 
exists a closed set QCE ^ such that each value y e F[E n ,] is as- 
sumed exactly once by F on Q. This set Q plainly fulfils the condi- 
tions stated. 

re (ii). In view of (i), there exists for each positive integer n 
a measurable set Q n CE , such that \F[E]— F[Q n ]\<lln, and on which 
the function F assumes each of its values at most once. Therefore, 
writing R=£Qn y we see immediately that \F[E]—F[R]' = 0 and 

n 

that on R the function F assumes each of its values at most an 
enumerable infinity of times. This completes the proof. 

We shall establish in this § three theorems due to Banach on 
functions which fulfil the conditions (N) or (S). The first of these 
theorems, which concerns functions fulfilling the condition (N), 
is as follows: 



284 CHAPTER IX. Derivates of functions of one or two real variables. 


(7.3) Theorem . Any continuous function F which fulfils the condition 
(N) on an interval 7, necessarily fulfils also the condition (T 2 ) on 7. 

Proof. Let us denote, for each measurable set ECI , by 91/? 
the class of all measurable sets RCE which are subject to the 
following two conditions: (i) jjPfJS'] — = 0, and (ii) each 

value yeF[E] is assumed by the function F at most enumerably 
often on JR. By Lemma 7.2, the class 91/? is non-empty, however we 
choose the measurable set ECI . We shall denote, for any such set E , 
by the upper bound of the measures of the sets (91/?). 

Consider, in particular, a sequence {H n } of sets (91/) such that 
lim |77„| = /* r Let H=}^H n and let U be a set (91/- //). We verify 

n n 

at once that |f7|=0, whence on account of the condition (N), 
\F[ f7]| = 0. Therefore \F[I — 77] | =\F[ f7]| = 0, so that almost every 
value y e F[I] is assumed by F only on the set 77, and therefore at 
most enumerably often. 

The second of the theorems of Banach concerns functions 
which fulfil the condition (8). 

(7.4) Theorem . In order that a continuous function F be subject 
to the condition (S) on an interval 7, it is necessary and sufficient 
that F be subject on I to both the conditions (N) and (T t ). 

Proof. 1° Suppose that the function F fulfils the 
condition (S) on 7. Since this condition clearly implies the con- 
dition (N), we need only prove that F fulfils the condition (T x ). 

Suppose then, if possible, that the set of the values assumed 
infinitely often on 7 by the function F , is of positive outer measure. 
Since this set is measurable by Theorem 6.4, it contains a closed 
subset Y of positive measure. Let X denote the set of all the points 
x e I such that F(x)e Y. The set X, plainly, is also closed. 

We shall now define by induction a sequence of measurable 
sets LTi) subject to the following conditions: (i) X r Xj = 0 whenever 
i# -jj (ii) \F[X t ]\^\Y\l2 for i= 1 , 2 ,..., and (iii) the function F 
assumes each of its values at most once on each set X h 

For this purpose, suppose defined the first k sets X t for which 

k 

the conditions (i), (ii) and (iii) are satisfied. Let 7?*== 7 —£X h Since, 

k /=i 

on yx h the function F assumes each of its values at most a finite 

/ - i 

number of times, it follows that each value y e Y is necessarily 
assumed on the set E/,. By Lemma 7.2, this set therefore contains 
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measurable subset X*+i such that \F[X k +i]\^\F[E k ]\[2'5z\Y\l2 and 
that each value is assumed by F at most once on X k +\ . The sets 
ATi,X 2 , ... } Xm+\ clearly fulfil the conditions (i), (ii) and (iii). 

The sequence {A"/} being thus defined, it follows from (i) that 

lim|X/| = 0, and hence, remembering that the function l* 7 fulfils 
1 

the condition (S), we have also lim |F[X/]| = 0. But this clearly 

i 

contradicts (ii), since |Y|>0. 

2° Suppose now that the function F fulfils the conditions (N) 
and (T^, but not the condition (S). We could then determine a po- 
sitive number a and a sequence of sets {E k \ in I so that for k— 1, 2, 

(7.5) |£7*|<l/2*, and (7.6) \F[E k ]\>a. 

Let us write E = lim sup E k and A = lim sup F[E k \. We easily see 

k k 

that, if $ e A, then either y e F[E], or else the value y is assumed 
by F on I infinitely often (in fact there exists an increasing sequence 
of positive integers and a sequence of points {#,}, every two 
of which are distinct, such that x i e E^ and F(xi) = y for * = 1 , 2 ,...). 

Now, on account of (7.5), we have |2?| = 0, and therefore 
also | J F[i?]| = 0. On the other hand, by (7.6), |A|>(r>0. Thus 
^ cr, and since, as we have just seen, each value 
y e A ~F[E ] is assumed by F infinitely often on 7, this contradicts 
the hypothesis that the function F fulfils the condition (Tj). 

We shall establish next a “differentiability theorem” for 
the functions which fulfil the condition (N): 

(7.7) Theorem . In order that a continuous function F be absolutely 
continuous on an interval 7 0 , it is necessary and sufficient that the 
function F fulfil simultaneously the condition (N) and the condition 

(7.8) I F' (x) dx < + oo, 

p 

where P denotes the set of the points at which the function F has 
a finite non-negative derivative. 

Proof. Since the conditions of the theorem are obviously 
necessary (cf. Theorem 6.7, Chap. VII), let us suppose that the 
function F fulfils the condition (N) and the inequality (7.8). Let g 
be the function equal to F'(x) for x e P and to 0 elsewhere. Then, 
if E denotes the set of the points x at which F'(;r) = + oo, we shall 
have F’(x) ^g(x) at every point x of I 0 —E at which the derivative 
F'(x) exists. 
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On the other hand, since |2?| = 0 (cf. Theorem 4.4, or 
Chap. VII, §10, p.236), we have |^[i7] | = 0, and, since the func- 
tion F fulfils, by Theorem 7.3, the condition (T 2 ), it follows from 
Theorem 6.9 that F is of bounded variation on I 0 . This completes 
the proof, since, by Theorem 6.7, Chap. VII, every continuous 
function of bounded variation, which fulfils the condition (N) is 
absolutely continuous. 

Theorem 7.7 (in a slightly less general form) was first proved by N. Bary 
[2; 3, p. 199]. It shows in particular that every continuous function F{x), which is 
subject to the condition (N) and whose derivative is non-negative at almost every point 
where F(x) is derivable , is monotone non -decreasing . This proposition contains an 
essential generalization of Theorem 6.2. Chap. VII. 

Theorem 7.7 may, moreover, be generalized still further. If a continuous 
function F(x) fulfils the condition (N) and if the function g(x ), equal to F'(x) wherever 
F(x) is derivable and to 0 elsewhere , has a major function (in the Perron sense), 
then the function F(x) is ACG # i. e. an indefinite F -integral. 

For the part played by the conditions (N), (TJ and (T 2 ) in the theory of 
Den joy integrals, cf. also J. Ridder [8]. 

From Theorem 7.7 we obtain the third theorem of Banach: 

(7.9) Theorem. Any function which is continuous and subject to the 
condition (N) on an interval , is derivable at every point of a set of 
positive measure. 

* § 8. Superpositions of absolutely continuous functions. 

Suppose given a bounded function G on an interval [a, 6], and a func- 
tion H defined on the interval [a,/?] where a and ft denote respectively 
the lower and the upper bound of G on [a, 6]. We call superposition 
of the functions G and H on [a, 6], the function H(G(x)). The func- 
tion G is termed inner function and the function H outer function 
of this superposition. 

If a function F is continuous and increasing on an interval 
[a, 6], the continuous increasing function G defined on the interval 
[F(a).F(b)] so as to satisfy the identity G(F(x)) = x on [a, 61, will, 
as usual, be termed inverse function of F and denoted by F ’. 

It has long been known that the superposition of two abso- 
lutely continuous functions is not, in general, an absolutely con- 
tinuous function. By means of the conditions discussed in the 
preceding §§, particularly the condition (8), Nina Bary and D. Men- 
choff succeeded in characterizing completely the class of functions 
expressible as superpositions of absolutely continuous functions. 
(Cf. also G. Fichtenholz [3].) 
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(8.1) Theorem . Any function F which is continuous and subject 
to the condition (T x ) on an interval [a, ft] is expressible on this interval 
as a superposition of two continuous functions , of which the inner 
function is of bounded variation and the outer function is increasing 
and absolutely continuous . 

//, further , the function F fulfils the condition (N), the inner 
function of this superposition is necessarily absolutely continuous also . 

Proof. Let a and denote respectively the lower and upper 
bounds of F on fa, 6], and let s F (y) denote, for each y, the number 
(finite or infinite) of the points of the interval [a, ft] at which F 
assumes the value y. Since, by hypothesis, the function F is contin- 
uous and subject to the condition (Tj), we shall have 0<1/Sf(3/)^1 
for almost all the values y of the interval [a,/?]. Let us denote by V 
an indefinite integral of the function which is equal to 1 /s f {y) on 
[a,/?] and to 1 elsewhere. We now write G(x) = U[F(x)] for xe[a,b\ 
We thus have F(x)=U 1 [(?(#)], and in order to establish the first 
part of the theorem, it is enough to show r that (i) the function U ~ l 
is absolutely continuous and (ii) the function G is of bounded variation. 

Suppose, if possible, that the function V~ l (which is continuous 
and increasing together with U) is not absolutely continuous. Then 
(cf. Theorem 6.7, Chap. VII), there exists a set E of measure zero 
such that |I/~'[2?]|>0. Writing Q=U~'[E], we thus have 

(8.2) |0| >0 and |[7[0]| = O. 

We may, plainly, suppose that the set E , and therefore the 
set Q, are sets (©,»). Thus (cf. Theorem 13.3, Chap. Ill) 

\U[Q]\= fU’(y)dy , 

Q 

which renders the relations (8.2) contradictory, since almost every- 
where U\y)— l/sp (y) > 0 for ye[a,/J] and U'(y)—l outside the in- 
terval [ a,/?]. 

In order to establish (ii), we shall make use of the criterion 
of Theorem 6.4. Denote for each z by »g(z) the number of the points 
of the interval [a, b] at which the function G assumes the value z. 
Since the function U is increasing, we clearly have Sa(U(y))=Sr(y) 
for each y, and 8q(z) = 0 for each z outside the interval [ If (a), U (/!)]. 
Hence, remembering that the function U is absolutely continuous, 
we obtain (cf. Theorem 15.1, Chap. I) 
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+ po UW) p 

J sa(z)dz= j 8q(z) dz — f s 0 (U(y))dU(y) = 

— oo £7\a) a 

P P 

=j s F (y)U'(y)dy = Jdy = p — a 

a « 

which shows, by Theorem 6.4, that the function G is of bounded 
variation. 

Finally, if the given function F fulfils the condition (N), so 
does the function G(x) — U(F(x)), and the latter, since it is of 
bounded variation, is absolutely continuous by Theorem 6.7, Chap.VII. 
This completes the proof. 

(8.3) Theorem. 1° In order that a continuous function F be expres- 
sible as a superposition of two continuous functions of which the inner 
function is of bounded variation and the outer function is absolutely 
continuous , it is necessary and sufficient that F fulfil the condition (Tj). 

2° In order that a continuous function be representable as a super- 
position of two absolutely continuous functions , it is necessary and 
sufficient that the function fulfil both the conditions (T,) and (N), or 
what amounts to the same, the condition (S). 

Proof. Since it follows at once from Theorem 8.1 that these 
conditions are sufficient, we need only prove them necessary. 

Let therefore F(x) — H(G(x)) on an interval [a, 6], where G 
is a function of bounded variation and H an absolutely continuous 
function. Let a and fi be respectively the lower and the upper bound 
of G on [a, 6], Let Eq and Eh denote the sets of the values which the 
functions G and H assume infinitely often on the intervals [a, 6] 
and [a,/?], respectively. Since the functions G and H fulfil the con- 
dition (TJ, we have \Eq\=\E h \ =0, and since the function H is, 
moreover, absolutely continuous, we have also |H[E G ]| = 0. Now 
we see at once that each value which is assumed infinitely often 
on [a, 6] by the function F, belongs either to E H , or to H[Eo]. The 
set of these values is thus of measure zero, and the function F ful- 
fils the condition (T,). 

If, further, the function G is absolutely continuous (as well 
as H ), then the function F is a superposition of two functions which 
fulfil the condition (N), and, thei fore, itself fulfils this condition. 
This completes the proof. 
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Theorems 8.1 and 8.3 are due to Nina Bary [1; 3, pp. 208, 633] 
(cf. also S. Banach and 8. Saks [1]). Part 2° of Theorem 8.3 was 
established a little earlier in a note of N. Bary and D. Menchoff [1] 
(cf. also N. Bary [3, p. 203]) in a form analogous to Theorem 6.2. Thus: 

(8.4) Theorem . In order that a function F which is continuous on 
an interval [a,fe] he on this interval a superposition of two absolutely 
continuous functions , it is necessary and sufficient that the set of the 
values assumed by the function F at the points of [a<b] at which F is 
not derivable , be of measure zero . 

Proof. Let Q F be the set of the points of [a, b] at which F 
is not derivable. Suppose first that 

1 <> F(x)= H(G(x)) on [a, 6], where H and G are absolutely 
continuous functions. Let Q a and Q H be respectively the sets 
of the points at which the functions G and H are not derivable. 
We have \Q a \ =\Q H \ =0 and, consequently, |.F[# 6 ]| = \H[Q fl ]\ = 0. 
Now, we see at once that if the function F is not derivable at a point x , 
then either xeQ (j , or G(x)eQ H . Therefore F[Q f ]CF[Q g ] + 
and hence, |jFK? F j| = 0. 

Conversely, suppose that 

2° = 0. By Theorem 6.2, the function then fulfils the 

condition (T x ). To show that F also fulfils the condition (N), con- 
sider any set of measure zero, E say, in [a, b]. Since the function F 
is derivable at each point of E—Q F , we have, by Theorem 6.5, 
Chap. VII, \F[E— Qj,]) =0, and since, by hypothesis, |P[<?,0! = 0, 
we obtain |2^[7^]| = 0. The function F thus fulfils both the conditions 
(TJ and (N), and is, therefore, by Theorem 8.3 (2°), a superposition 
of two absolutely continuous functions. 

It follows from Theorem 8.3 (2°) that a superposition of any finite number 
of absolutely continuous functions is expressible as a superposition of two absolutely 
continuous functions . For the superposition of any finite number of functions 
which fulfil the condition (8), itself fulfils this condition. 

The results exposed in this § have been the starting point of the important 
researches of Nina Bary [31 on the representation of continuous functions 
by means of superpositions of absolutely continuous functions. Let us cite two of 
her fundamental theorems: 1° Every continuous function is the sum of three super - 
positions of absolutely continuous functions , and there exist continuous functions 
which cannot be expressed as the sum of two such superpositions. 2° Every continuous 
function which fulfils the condition (N) — or, more generally , eeery continuous function 
which is derivable at every point of a set which has positive measure in each interval — 
is the sum of two superpositions of absolutely continuous functions , and there exist 
continuous functions which fulfil the condition (X), but are not expressible as one 
superposition of absolutely continuous functions (the function U{x) discussed above 
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§ 9. The condition (D). We shall now establish, for the 
extreme approximate derivates, a theorem, analogous to Theo- 
rem 4.6, but whose proof depends on a different idea. It is con- 
venient to formulate it, from the beginning, in a slightly more ge- 
neral manner. 

Given two positive numbers N and e, we shall say that a func- 
tion F fulfils at a point x 0 the condition (Djv, ,), if there exist positive 
numbers h, as small as we please, such that the difference between the 
outer measure of the set E[E(;e) — F(x 0 ) ^ N • ( x — x 0 ); — x 0 ^h] 

X 

and that of the set E [F(x)—F(x 0 )^—N-(x—x 0 )-, 0^.x—x o ^h] 

X 

exceeds the number he in absolute value. By symmetry, merely 
replacing F(x)— F(x 0 ) and x—x 0 by F(x 0 )—F(x) and x 0 —x respec- 
tively, we define the condition (D^ f ). 

If, for a point x Q , there exists a pair of finite positive numbers N 
and e such that the function F fulfils at this point the condition (D# if ), 
or the condition (D^r, f ), we say that F fulfils at x 0 the condition (D). 

For measurable functions the condition (D) may be formulated 
more simply: a measurable function F fulfils at a point x oy the con- 
dition (D), if there exists a finite positive number N such that x 0 
is not a point of dispersion for the set of the points x at which 
|F(#) — F(x q ) ^N-\x — x 0 \. 

(9.1) Theorem . If any one of the four approximate extreme derivates 
of a function F is finite at a point x 0 , then the function fulfils the con- 
dition (D) at this point. 

Proof. Suppose, to fix the ideas, that \Ft ? (x 0 )\<-{-co and write 
^==|F+(.r 0 )|-f 1. Let E x and E 2 be the sets of the points x which are 
situated on the right of the point x 0 and which fulfil respectively the 
inequalities F(x)~~ F(x 0 )^N‘(x~ x 0 ) and F(x)w F(x 0 )^~ -N-(x—x 0 ). 
It follows at once from the definitions of approximate derivates 
(Chap. VII, § 3) that x Q is a point of dispersion for the set E x , while 
E 2 has at x 0 a positive upper outer density. Denoting the latter 
by d, we see at once that the function F fulfils at x Q the condition 
(Dau) whatever be the positive number e<6. 

(9.2) Lemma . Let N and e be finite positive numbers and suppose 
that a finite function F fulfils the condition (Div, f ) at each point of 
a set E. Then \F[E]\^(2 N /e)-\E\. 
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Proof. We shall first show that for every interval / = [«,&], 

(9.3) \l\Mel‘2N).\F[E.I]\. 

For this purpose, we write, for every y, 

(9.4) H(y)=\E[F(x)^y,xel]\. 

X 

The function H thus defined is non-increasing and bounded on the 
whole/ straight line (— oo, + oo); we have, in fact, for every y. 

(9.5) OsSff(?/K|Z|. 

Given an arbitrary point y 0 of F[E-I], which is distinct from 
F(b) and at which the function H is derivable, let us consider a point 
x Q eE-I such that F(x 0 )=y 0 . Plainly j? 0 =t =b. Let us write, for brevity, 

A (k,k)=E[F(x)^y 0 -\- k-(x — j? 0 ); 0<.r — x 0 ^h] 

X 

and 

B(h,k)—Fi[F(x)^y 0 +khy 0^x~x 0 ^h]. 

X 

For every subinterval [x 01 x 0 +h] of i, we then have the re- 
lation B(h,N)C A(h,N)C A(h, — N)CB(h 1 — JV), whence it follows 
easily, on account of (9.4), that 

H(y 0 -Nh)-H(y 0 +Nh)>\B(h,-N)\-\B(h,N)\^\A(h,-N)\-\A(h,N)\. 

Now, since F fulfils, by hypothesis, the condition (Djv,<) at ./' 0 , there 
exist positive values h, as small as we please, such that 

\A(h,-N)\-\A(KX)\>he, 

and therefore H(y 0 — Nh)— H(y 0 + Nh)^the. Hence, H’(y 0 )^—el‘2 X 
for every point y 0 ^F(b) of F[E1] at which the function II is 
derivable. Therefore, denoting, for each positive integer n , by Q„ 
the part of the set F[E-I] contained in the interval [— n, w], we 
find, on account of (9.5), \I\^\H(n) — H( — n)\^e-\Q„\l2N , from which 
the inequality (9.3) follows by making n -> oo . 

This being established, let rj be a positive number and \h} 
a sequence of intervals such that 

(9.6) EC Elk and \E\ + n>E\h\. 

k ft 

Since (9.3) holds for each interval /, it follows from (9.6) that 
|-E|+ 7) > (s/2 N) -Z\ F[E -Ik]\> (el'2N)>\F[E]\ y whence, remembering 

ft 

that t) is an arbitrary positive number, we see that |F[KJK(2N/f)-;/£|. 
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(9.7) Theorem . If at each point of a set E , a finite function F ful- 
fils the condition (D) (and so } in particular , if at each point of E the 
function F has one of its extreme approximate derivates finite ), 
then the function F fulfils the condition (N) on E . 

Proof. Let H be any subset of E of measure zero, and let H n 
denote, for each positive integer n, the set of the points x of H at 
which the function F fulfils the condition (D£i,„) or (D^i,,). We 
clearly have H=£H r ,, and since, by Lemma 9.2, |P[/L/]|<4n 2 |ff/,|==0, 

we obtain |F[i7]|=0. 

Theorem 9.7 enables us to complete Theorems 10.5 and 10.14 
of Chap. VII, as follows: 

(9.8) Theorem . 1° Every finite function F which is continuous on 
a closed set E and which has at each point of E , except perhaps those 
of an enumerable subset , either two finite Dini derivates on the same 
side , or one finite extreme bilateral derivate , is ACG* on E. 

2° Every finite function F which is continuous on a closed set E 
and which has at each point of E , except perhaps those of an enumerable 
subset , either one finite Dini derivate , or one finite extreme approximate 
bilateral derivate , or finally two finite extreme approximate unilateral 
derivates on the same side , is ACG on E. 

Proof. By Theorems 10.1, 10.5, 10.8 and 10.14 of Chap. VII, 
the function F is VBG* on E in case 1° and VBG on E in case 2°. 
On the other hand, by Theorems 4.6 and 9.7, this function fulfils, 
in both cases, the condition (X) on E . Hence, by Theorems 6.8 
and 8.8 of Chap. VII, the function is ACG* on E in case 1°, and 
ACG on E in case 2°. 

In the most important case in which the closed set E is an 
interval, Theorem 9.8 may further be stated in terms of Denjoy 
integrals. For this purpose*, let us begin by noting the following 
proposition (cf. A. S. Besicoviteh [2], and J. C. Burkill and 
17. S. Haslam- Jones [1]): 

(9.9) Theorem . If a finite function F is measurable on a set E and 
has at each point of this set one of its Dini derivates finite , then this 
derivate is , at almost all points of E y an approximate derivative of F . 

Proof. It follows from Theorem 10.8, Chap. VII, that the 
function F is VBG on E, and so, approximately derivable at almost 
nil the points of E, Let us denote by E l the set of the points of E 
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at which one at least of the opposite Dini derivates F + and F~~ is 
finite. Plainly, F~(x)^F'* 9 (x)^:F + (x) at each point x at which 
the approximate derivative F'^x) exists, and therefore by 
Theorem 4.1, F~ (x) = F + (x) = F , aL? (x)^ at almost all points x 
of E x . Similarly, we show that F + (x)=^F’~(x)=^F , fiV (x) at almost all 
the points x of E at which one of the derivates F+ and F~ is finite. 
This completes the proof. 

(9.10) Theorem . 1° If f is a finite function which , at each point of 
an interval I 0 , except those of an enumerable set , is equal to an extreme 
bilateral derivate of a continuous function F , then the function f is in- 
tegrate on I 0 and the function F is an indefinite 9 ^-integral of /. 

2° If f is a finite function which , at each point of an interval Z 0 , 
except those of an enumerable set , is equal either to a Dini derivate , or to 
an extreme approximate bilateral derivate of a continuous function F , 
then the function f is 9-integrable on I 0 and the function F is an in- 
definite 9-integral of /. 

Proof. In view of Theorem 9.8, the function F is AOG* in 
case 1°, and ACG in case 2°. Moreover, at almost all the points x 
of E , we have F‘(x)=f(x) in case 1°, and by Theorem 9.9, F^ v (x)=rf(x) 
in case 2°. This proves the theorem. 

Although Theorem 9.8 presents a formal analogy with Theorems 10.5 and 
10.14 of Chap. VII, there is an essential difference between the result of this § and 
those of §10, Chap. VII. We see, in the first place, that the criteria of Theorems 
10.5 and 10.14 of Chap. VII concern functions which are given on quite arbitrary 
sets, whereas those of Theorem 9.8 are established only for closed sets. In the 
second place, if the derivates of a quite arbitrary function satisfy on a set E the 
conditions of Theorem 10.5, or of Theorem 10.14, of Chap. VII, then the set E can, 
by these theorems, be decomposed into a sequence of sets on which the function 
is absolutely continuous. On the contrary, Theorem 9.8 of this § does not enable 
us to draw any conclusion as to a similar decomposition of the set E (even when 
this set is an interval), unless the function considered is continuous. 

Two examples will now be given to show that this feature of 
Theorem 9.8, fwhich represents a restriction as compared with the results of § 10, 
Chap. VII, is essential for the validity of the theorem. 

(i) Consider the function F{x)— £[2 n x]/5 n t where [2 n x] denotes, as usual, 

n 

the largest integer not exceeding 2 n x. This function is increasing. Its lower right - 
hand derivate is finite everywhere, and even, as we easily see, vanishes identically. 
Nevertheless, there is no decomposition of the interval J 0 =[0, 1] into a sequence 
of sets on which F is absolutely continuous, or even only uniformly continuous. 
In fact, no such decomposition can exist for a monotone function F whose points 
of discontinuity form a set everywhere dense in J 0 . 
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For, if such a decomposition {E x , E 2 * •••* existed, one at least of the 

sets E n would, by Baire’s Theorem (Chap. II, §9), be everywhere dense in an interval 
ICJ 0 . This is plainly impossible since the function F, monotone by hypothesis, 
is uniformly continuous on each set E n and has points of discontinuity in the in- 
terior of 1. 

(ii) Let us now consider an example of a continuous function 
F(x ), increasing on the interval «/ 0 = [0, 1 ], and which has its lower 
right-hand derivate zero at every point of a set E> without being 
ACG on E. 

For this purpose, let us agree to call, for brevity, function attached to 
an interval /—[a, 6], any function H(x)> which is continuous and non -decreasing 
on /, and which fulfils the conditions: 

(a) . H(x) is constant on each of the intervals l k of a sequence of non- 
overlapping sub-intervals of / such that |/|= ; the length of any sub- 

k 

interval of I on which II(x) is constant does not exceed \I\/2; 

(b) H(x) --//(a) • -x — a and H(b) — H(x)^b—x for every xtl. 

Such a function is easily obtained, by slightly modifying the construction 
of the function f(x) t considered in Chap. Ill, § 13, p. 101. 

This being so, we shall define by induction a sequence { F n (x ) } of functions 
attached to the interval J 0 , beginning with an arbitrary function F^x) attached 
to this interval. Given the function F n attached to J 0 , let Oj^, 

be a sequence of the intervals of constancy of F n in the interval J Q . (By an 
interval of constancy of a function in J 0 we mean here any interval I(Z<Jo 
such that the function is constant on I without being constant on any sub- 
interval of J 0 which contains I and is distinct from /.) For each k— 1 , 2 ,..., 
we determine a function H^\x) attached to the interval and we write 

V<*>r//('»<fc<' ,) ) _ H\ n \a\ n) )] for x,J„ 

i k 

hr x*l k {n \ fc=l,2, .... 

the sum 2^'* being extended over all the values i such that b\ n) ^ x. 

/ 

The sequence {F n {x)\ being thus defined, let 
(9-11) *»=2>„(*)/2". 

n 

The function F(x) is clearly continuous, increasing, and singular on <7,,. 

Consider the set (I* 1 *) 0 , and let x 0 be any point of E. Then there 

nk 

exists a sequence 2 , intervals each of which contains x 0 in its 

interior. Plainly, for each positive integer n, Fj(b^) — Fj{x 0 ) — 0 if i^.n, and 
Fj(b^)- Fj(x a ) < —x a if j>n. Hence, by (9.11), F(bty~ F (*„)< (6<"> — x 0 )/2 n 

for each n, and therefore F + (x 0 )=O. 

Nevertheless, the function F is not ACG on E. To see this, suppose, if pos- 
sible, that E is the sum of a sequence of sets E n on each of which the function 
F is AC. Since the set J 0 — E is the sum of a sequence of non-dense closed sets, 
one at least of the sets E n is everywhere dense in a sub-interval I of */ 0 , and, 
since the function F is absolutelv continuous on each E . this function wmiM 
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8 10. A theorem ot Denjoy-Khintchine on approximate 
derivates. The considerations of the preceding § will now be 
completed by a theorem which establishes, for the extreme ap- 
proximate derivates, relations similar to those which hold for 
Dini derivates (cf. § 4). This theorem was proved independently 
by A. Denjoy [6, p. 209] and by A. Khintchine [4; 5, p. 212] 
(cf. also J. C. Burkill and U. S. Haslam- Jones [1;3]). 

(10.1) Theorem. If a finite function F is measurable on a set E 

and if, to each point x of E, there corresponds a measurable set Q{x) 
such that (i) the lower unilateral density of Q(x) at x is positive on at 
least one side of the point x and (ii) or Fq{ X ){x)> — oo, then 

the function F is approximately derivable at almost all the points of E. 

Consequently, if a finite function F is measurable on 
a set E, then at almost every point of E either the function F 
is approximately derivable, or else Ft v (x) = F, n {x) = -j- oo and 
F»p(x) = ^ = Fup(x) = oo. 

Proof. In view of Lusin’s Theorem (Chap. Ill, §7), we may 
suppose that the set E is closed and that the function F is con- 
tinuous on E. To fix the ideas, consider the set A of the points x e E 
such that (i,) the lower right-hand density of Q(x) at x is positive 
and (iii) Fq M (x)<+ °°- We shall show that the function F is ap- 
proximately derivable at almost all the points of A. By symmetry, 
this assertion will remain valid for each of the other three subsets 
of E, defined by a similar specification of the conditions (i) and (ii) 
of the theorem. 

Let us denote by P the set of the points of A at which the 
function F is not approximately derivable, and suppose, if pos- 
sible, that |P|>0. For each positive integer n, let A„ be the set 
of the points x of E such that the inequality 0^ft<l/n implies 

(10.2) |E [F (t)— F (x)^n-(t —x)-, teE\ x^t^.x+h]\^-hln. 

t 

The sets A„ are closed. To see this, let us keep an index « fixed 
for the moment, and let {x/ x ll= i, 2 ,... be a sequence of points of the 
set A„ converging to a point x 0 . Let I n be a non-negative number, 
and, for brevity, let E/=E[F(t)-F(xi)^n-(t --*/); teE; Xi^t^Xt+h] 

t 

where i— 0,1,2,.... We obtain \Ei\^hjn for t= 1,2, ..., and since, 
by continuity of F on E, we have E 0 D lim sup Et, it follows 

(cf. Chap. I, Theorem 9.1) that \E 0 \^h[n, which shows that x 0 eA„, 
i. e. that A„ is a closed set. 
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Let us now denote, for every pair of positive integers n and k, 
by A„, k the set of the points x of A n such that the inequality 
implies 

(10.3) |E[teA„; x— ^n- l )-h. 

t 

We observe easily that the sets A n , and therefore the sets A„,* 
also, cover the set A almost entirely. Hence, there exists a pair of 
positive integers n 0 and fc 0 such that |^.^ 1 * 1 ,-P|>0. Let R denote 
a portion of the set A n „,k„-P such that 

(10.4) |P|>0, (10.5) 6(R)<l/n 0 and (10.6) d(R)<l/k 0 . 

Writing G(x)=F{x)—(n 0 -\-l)-x, we shall show that the func- 
tion G is monotone non-increasing on R. Suppose therefore, if pos- 
sible, that there exist two points a and b in R, where a<b, such that 

(10.7) G(a)<G(b). 

Let J= [<*,&]. Since the set A no is closed and the function G 
continuous on A n<> , the function G attains, at a point c of the set 
A nil - J, the lower bound of its values on this set. In virtue of (10.7) 
we have c<b. Since ceA^, and since, by (10.5), 0<f>— c<l/n 0 , 
we may put n=n 0 , x=e and h=b—o in the relation (10.2). We 
thus obtain 

(10.8) \R[G(t)-G(c)^ — (t-cy, teE-,c^t^b]\^(b-c)[n 0 . 

t 

Again, since beRCA„ a ,k,, and since, by (10.6), 0<5 — o<l/Zc 0 , 
we may put n=n 0 , x=b and h=b—c in (10.3). This gives 

(10.9) |E[feA„ 0 ; c<f<6]|>(l -c). 

t 

Now the sets which occur in the relations (10:8) and (10.9) 
are both measurable; it therefore follows from these relations that 
there exist, in the open interval (c,b), points teA^ for which 
G(t)—G(c)^.—(t—c)<0. This is plainly impossible, since the 
function G attains its minimum on the set An,,-J at the point c. 

The function G is thus monotone on the set R, and since it 
is, moreover, measurable (indeed continuous) on the closed set 
EdR, it follows that G is approximately derivable at almost all 
the points of R. On the other hand, however, since RCP the 
function F is approximately derivable at no point of R, and, in view 
of (10.4), we arrive at a contradiction. This completes the proof. 
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By a slight modification of the proof, we may extend Theorem 10.1, in 
a certain way, to functions which need not be measurable. Let us agree to under- 
stand by approximate derivability of a finite function F at a point x 0 , the existence 
of a set for which x Q is a point of outer density and with respect to which the 
function F is derivable (if the function F is measurable, this notion of approxi- 
mate derivability clearly agrees with the definition of Chap. VII, § 3). When 
approximate derivability is interpreted thus in the statement of Theorem 10.1, 
this theorem remains valid without the hypothesis that the function F be meas- 
urable on the set E (although the hypothesis concerning the measurability of 
the sets Q(x) remains essential). 

From Theorem 10.1, we may deduce the following proposition: If, for 
a finite function F , we can make correspond to each point x of a set E, a measurable 
set Q(x) whose lower right-hand density at x is positive , and with respect to which 
the function has an infinite derivative at x f then the set E is of measure zero. This 
theorem is similar to Theorem 4.4, but only partially generalizes the latter. It is 
not actually possible to replace, in Theorem 4.4, the ordinary, by the approx- 
imate, limit, without also removing the modulus sign in the expression 
| F{x+ Zi)-jF(x)|. This rather unexpected fact was brought to light by V. J arnlk [2], 
who showed that there exist continuous functions F for which the relation 
lim ap \F(x+h) — F(x)\/h=+oo holds at almost all points x. 

Finally, let us note that Theorem 10.1 is frequently stated in the following 

form: 

If a finite function F is measurable on a set E , then at almost every point x 
of E either (i) the function F is approximately derivable , or else (ii) there exists a meas- 
urable set E(x) whose right-hand and left-hand upper densities are both equal to 
1 at x y and with respect to which the two upper unilateral derivates of F at x are +00 
and the two lower derivates — 00 . 

It has been shown by A. Khintchine [4] (cf. also V. Jam Ik [2]) that 
there exist continuous functions for which the case (ii) holds at almost every point x. 

§ 11. Approximate partial derivates of functions of 
two variables. The §§ which follow will be devoted to generali- 
zations of the results of § 4 for functions of two real variables (their 
extension to any number of variables presents, as already said, 
no fresh difficulty). In this § we shall establish some subsidiary 
results. 

Given a plane set Q and a number rj, we shall understand 
by the outer linear measure of Q on the line y ~ rj, the measure of 
the linear set E[(t, rj) eQ]. Similarly, we define the outer linear measure 

i 

of Q on a line x=£, where f is any number. It follows from Fubini’s 
Theorem in the form (8.6), Chap. Ill, that if Q is a measurable set 
whose linear measure on almost all the lines y—rj (i. e. on the lines 
y=t] for almost all values of rj) is zero, then the set Q is of plane 
measure zero. 
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A point (x 01 y 0 ) will be termed point of linear density of a plane 
set Q in the direction of the x-axis , if x 0 is a point of density of the 
linear set E[(J,y 0 ) f $]. We define similarly the points of linear den- 

t 

sity of Q in the direction of the y-axis. 

(11.1) Theorem . Almost all points of any measurable plane set Q 
are points of linear density for it both in the direction of the x-axis 
and in that of the y-axis . 

Proof. We may clearly assume that the set Q is closed. Con- 
sider, to fix the ideas, the set D of the points of Q which are points 
of density of Q in the direction of the ;r-axis. Since the set Q—D 
is of linear measure zero on each line y= rj, the proof of the relation 
— Z>| = 0 reduces to showing that the set D is measurable. 

In order to do this, we write, for each point (x,y) and each 
pair of numbers a and fc, 

E(x,y, a,b) = E[(t,y)eQj a^ts^b], 

t 

and we denote, for each pair of positive integers n and k , by Q n , k the 
set of the points (x,y) of Q such that the inequalities a<x<b and 
b—a^l/k imply \E(x,y; a,6)|>(l — n~ x )(b — a). Plainly D=fJUQn,/f 

n k 

We now remark that all the sets Q n>k are closed. To see this, 
we keep the indices n and k fixed for the moment, and consider 
an arbitrary sequence {(x l9 y.)) i==l 2 of points of Q n , k which converges 
to a point (x 0 ,y 0 ). Let a and b denote real numbers such that a<x 0 <b 
and b— a^.l[k. For every sufficiently large index i, we then have 
a<Xi<b, and so |2?(#.,t/ / ; a,fc)|>(l — n~ l )-(b — a). Now it is easy to 
see that lim sup E(x n t/.; a,b)CE(x 0 ,y 0 ) a, 6); it therefore follows from 

i 

Theorem 9.1, Chap. I, that |22(# 0 ,# 0 ; a,fc)|^(l — n *)•(& —a), and so, 
that 2 /q) € Q n * k ‘ 

Since the sets Q n ,k are closed, D is a set ($<,,*) and this com- 
pletes the proof. 

If F is a finite function of two variables, the extreme approx- 
imate partial derivates of F(x,y) with respect to x will be denoted 
by E a Pjc , F aPjr and F aPx . If these derivates are equal at a point 
(x,y), their common value, i. e. the approximate partial derivative 
of F with respect to x, will be denoted by F' iPx (x,y). Analogous 
symbols will be used with respect to y. For the partial Dini deri- 
vates, we shall retain the notation of Chap. V, namely Ft, Ft, etc. 
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(11.2) Theorem. If a finite function of two variables F is meas- 
urable on a set Q, its extreme approximate partial derivates are them- 
selves measurable on Q. 

Proof. In view of Lusin’s Theorem (Chap. Ill, § 7), we may 
suppose that the set Q is closed and that the function F is con- 
tinuous on Q. Consider, to fix the ideas, the derivate F£ Px . Let a be 
any finite number and let P be the set of the points (x,y) 
of Q at which Ft Px (x,y) < %. We have to prove that the set P is 
measurable. 

For this purpose, let D denote the set of the points of the 
set Q w'hich are its points of linear density in the direction of the 
aj-axis. Further, for every point (x,y) and every positive integer n, 
let E n (x,y) denote the set of the points t such that 

f>x, (t,y)eQ and F(t,y) —F(x,y) < (a + » _1 ) • (t — x). 

We easily observe (cf. Chap. VII, § 3) that, in order that 
F£ Px (x 0 ,y 0 ) ^.a, at a point (x 0 ,y 0 ) e D, it is necessary and suffi- 
cient that the point (x 0 ,y 0 ) be a point of right-hand density for 
every set E n (x 0 ,y 0 ), where n = 1 , 2 ,... Hence, denoting for every 
system of three positive integers n, k and p, by Q„, *, p the set of 
the points (x,y) of Q such that the inequality O^h ^ l/p implies 
\E„ (x, y) ■ [x, x+ A]| ^ ( 1 — k ~ I ) ■ h, we have 

(n.3) PD=nnzQ.,k. P . 

n k p 

Now the set Q is closed and the function F is continuous on Q, 
and by means of Theorem 9.1, Chap. T (cf. the proofs of Theorems 10.1 
and 11.1) we easily prove that all the sets Q„,k,p are closed. Hence, 
by (11.3), the set P-D is measurable, and since, by Theorem 11. 1, 
|Q — Z>| = 0, we see that the set P is measurable also. This completes 
the proof. 


It follows, iu particular, from Theorem 11.2 that the extreme approximate 
derivates of any finite measurable junction of one real variable are themselves mea- 
surable functions. We thus obtain a result analogous to Theorem 4.3, Chap. IV, 
which concerned the measurability of Dini derivates (of. also Theorem 4.1, ('hap. V . 
and the remark p. 171). 
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# 12. Total and approximate differentials. A finite func- 
tion of two real variables F is termed totally differentiable, or simply 
differentiable, at a point (x 0 , y 0 ) if there exist two finite numbers 
A and B such that the ratio 

(12.1) [F{x,y)—F{x 0 ,y 0 )—A-(x—x 0 )—B-{y—y 0 )]![\x—x 0 \ + \y—y 0 \] 

tends to zero as (x,y)~*-(x 0 ,y 0 ). The pair of numbers {A, 15} is then 
termed total differential of the function F at the point (x 0 ,y 0 ) and 
we see at once that A and B are the partial derivatives of F at 
(x n , )/ 0 ) with respect to x and to y respectively. 

If, for a finite function of two variables F and for a point 
(j? 0 » ^o)> there exist two finite numbers A and B such that the ratio 

(12.1) tends approximately to 0 as (x,y)->(x 0 ,y 0 ), the function F 
is termed approximately differentiable at (x 0 ,y 0 ) and the pair of 
numbers {A, 15} is called approximate differential of F at (x 0 ,y 0 ). 
The numbers A and B will be called coefficients of this differential. 

We see at once that no function can have at a given point 
more than one differential, whether total or approximate. 

The existence of a total differential of a function F(x,y) at a point may 
be interpreted as the existence of a plane, tangent at this point to the surface 
z—F(x,y) and non-perpendicular to the scy-plane. In this way the notion of total 
differentiability of functions of two variables corresponds exactly to the similar 
notion of derivability of functions of one variable. Nevertheless, whereas every func- 
tion of bounded variation of one variable is almost everywhere derivable, a 
function of bounded variation (in the Tonelli sense), and even an absolutely 
continuous function, of two variables may be nowhere totally differentiable 
(cf. \V. Stepanoff [3, p. 515]). 

The coefficients of an approximate differential of a function 
a't a point are not, in general, approximate partial derivatives of 
this function. Nevertheless they coincide with the latter almost 
everywhere, as results from the following theorem: 

(12.2) Theorem. In order that a finite function of two variables F, 
which is measurable on a set Q, be approximately differentiable at 
almost all the points of this set, it is necessary and sufficient that F 
be, almost everywhere in Q, approximately derivable with respect to 
each variable. 

When this is the case, the approximate partial derivates K Px (x,y) 
and F* v (x,y) are, at almost all the points (x.y) of Q, the coefficients of 
the approximate differential of F. 
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Proof. .1° Suppose that the function F is approximately 
differentiable at almost all the points of Q. We denote, 
for each positive integer n, by R„ the set of the points (f, rj) of Q 
such that, for every square J containing (£, rj), we have 

(12.3) \V[\F(x,y)-F(t,T,)\^n-d(jyAx,y)eJ]\^3-\J\l4< 

whenever d(J)^2/n. Writing R=^R n , we clearly have \Q —#1 = 0. 

n 

Let us now denote, for a general plane set F and any number rj, 
by E m the linear set of the points f such that (f, rj)eE. Keeping 
fixed, for the moment, a positive integer n 0 and a real number rj 0 , 
we consider any two points £, and f 2 of R^ 1 for .which 0<| 2 — ^!<l/» 0 , 
and we denote by J 0 the square [f 1( f 2 ; rj 0 ,rjQ+ — £j]- We then 
have d(J 0 )^2/n 0 , and so, putting n = n 0 , J=J W v=Vo i n (12.3). 
and choosing f=f, and f = f 2 successively, we see at once that the 
square J 0 contains points (x,y) for which we have at the same time 

|P(fi, f]o) F (x,y)\^.n 0 - 6(J 0 )^.2n 0 -(^ 2 fj) 

and 

|P(f 2 , rj 0 )~F(x,y)\^n 0 -6(J 0 )s^2n 0 -(^—^). 

Hence \F(£ V rj 0 ) — F(£ v j? 0 )| < 4n 0 - 1^ 2 — fj, which shows that, 
for any fixed rj, F(x,rj), as a function of x , is AC on each set R^\ 
and so VBG on the whole set R M (cf. Chap. VII, § 5). Now R is 
(with Q) a plane measurable set, so that the linear set .ft 1 '' 1 is meas- 
urable for almost every rj. Hence (cf. Theorem 4.3, Chap. VII) 
for almost all rj, the function F(x,rj) is approximately derivable 
with respect to x at almost all the points of A 1 '' 1 . Since further, 
by Theorem 11.2, the set of the points of R at which the function F 
is approximately derivable with respect to one variable, is meas- 
urable, it follows at once that the function F is approximately 
derivable with respect to x at almost all the points of ft, and so, 
at the same tiipe, at almost all the points of Q. Similarly, we establish 
the corresponding result concerning approximate derivability of F 
with respect to y. 

2° Suppose that the function F is approximately 
derivable, at almost all the points of Q , with respect 
to x and with respect to y. We shall show that the function F 
then has, at almost all the points of Q, an approximate differential 
with coefficients F' aPx (x,y) and F' nPy (x,y). On account of Theorem 11.2 
and of Lusin’s Theorem (Chap. Ill, § 7), we may suppose that 
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(a) the set Q is bounded and closed, (b) the function F is approximate- 
ly derivable with respect to each variable at all the points of Q , and 
(c) the function F , and both its approximate partial derivatives, 
are continuous on Q . 

This being so, we write, for each point of Q and each 

point (x,y) of the plane, 

(12.4) I)(Z,r)-,x y y)=\F(x,y)—F(£,Ti)—(x—t)-FL Px (£,ri)—(y—T])-Fz Py (£,ri)\, 

A(£> Vi x ) = I * 1 ( x , v)— F (Z, V)~ ( x — f , »?)|, 

0 2 (l, rj; y) = \F(£,y)—F(£,r])—{y — T])-Fa Pu {£,Tj)\. 

net s and r be any positive numbers. We shall begin by de- 
fining a positive number o and a closed subset A of Q such that 
\Q— A\<e and such that, for any point (f, * 7 ), 

(i) Vi k-fl; (*, 1?) >(i- e ).(ft- a ) 

whenever (£,r})eA } a<f<ft dmd ft — a<<r. 

For this purpose, let us denote, for each positive integer n, 
by the set of the points (f, ??) of # such that the inequality in the 
first line of (i) is fulfilled whenever a<£<b and ft— a<l/n. Since 
the set Q is closed and since the function F and its derivatives F' aPx 
and F'apy are continuous on Q, it is easily seen that all the sets A n 
are closed. On the other hand, the sets A n form an ascending se- 
quence and we immediately see that the set Q — lim A n is of 

n 

measure zero on each line y±=rj. Hence, this set being measurable, 
we have |Q — lim — 0. Consequently | Q—A *j<« for a suffi- 

n 

ciently large index n 0 , and writing a=l/n 0 and A = A no we find 
that the inequality | Q — A\<e and the condition (i) are both satisfied. 

In exactly the same way, but replacing the set Q by A and 
interchanging the role of the coordinates x and y , we determine 
now a positive number a^<o and a closed subset B of A such that 
\A—B\<e and that for any point (£, rj) 

^ yKr-ly — 1,1; (f, y)f A] a^y^b]\^(l-e)-(b—a) 

whenever (£,r))eB, a^tj^b and b — a<a v 
Finally, let <r 2 < <r i be a positive number such that 

y 2 )—Kp x (x 1 , yj |<r 

for any pair of points (x 1 ,y l ) and {x 2 , y 2 ) of Q subject to the con- 
ditions \x 2 — a?,|<<r 2 and |y s — yjctr,. 
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This being so, let (f 0 , r/ 0 ) be any point of B. Let J = a 2 ,p 2 ] 

denote any interval such that (l 0 >»?oW and d(J)<a 2 <a 1 <a. We 
write: 

E 2 =E[D 2 (£ 0 , rj 0 ; y)^r-\y — Tj 0 \; (£ 0 ,y)eA] a 2 ^y^p 2 ], 

V 

and. for each y , 

X 

Then any point (x,y) such that yeE 2 and xeE^y) belongs* to the 
set Q J and, for such a point, we have 

D((otfoiXi ,v)^ fol * ap^fo?#) F ap x ( fo>i/o)l ^ 

<2t-[|*— fol+ly — ijJ]. 

On the other hand, it follows at once from (ii) and (i) respectively, 
that \E 2 \^(1 — e) (/5 2 — a 2 ), and \E x (y)\^(l— whenever 

yeE 2 . Hence, D(i 01 1 ) 0 ; x,y) being a measurable (indeed contin- 
uous) function of the point (x 7 y) on Q-J, it follows that the set 
of the points (x,y)eQ-J such that D($ 0 ,rj 0 ) a?,yK2r-[|j? — f 0 |+Lv ~ r h\] 
is of measure at least equal to (1 — e) 2 (fa — oq) (fi 2 — a 2 )= (1 — f) 2 -|J|. 
The point (£ 0 , ?j 0 ) here denotes any point of the set JB, and J any 
interval, containing (£ 0 , // 0 ), whose diameter is sufficiently small. 
Therefore, since |Q— Z?|<|(^--^t|+ \A — B|<2r, where e is at our 
disposal, we see that, for every positive number r, almost every 
point (f, rj) of Q is a point of density for the set of the points (x,y) 
of Q which fulfil the inequality />(£,//; x,y)l[\x— f|+ | y — ry|]<2r; and 
in view of (12.4), this completes the proof. 

We notice a similarity between the preceding proof and that of the “Den- 
sity Theorem” (Chap. IV, § 10). Actually the result just established constitutes 
a direct generalization of the Density Theorem. To see this, we need only inter- 
pret, in the statement of Theorem 12.2, the function F as the characteristic 
function of the set Q (cf. the first edition of this book, p. 231). 

The notion of approximate differential, together with Theorem 12.2, are 
due to W. Stcpanoff (3]. There is, however, a slight difference between the defi- 
nition adopted here and that of Stepanoff, so that, in its original form, as proved 
by Stepanoff, Theorem 12.2 generalizes Theorem 6.1, Chap. IV, rather than the 
Density Theorem of § 10, Chap. IV. 

We conclude this § by mentioning the following theorem, 
which, in view of Theorems 9.0 and 11.2, is an immediate con- 
sequence of Theorem 12.2: 
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(12.5) Theorem. Suppose that a finite function of two variables F 
which is measurable on a set Q, has at each point of Q at least one finite 
Dini derivate ■ with respect to x and at least one finite Dini derivate 
with respect to y. 

Then the function F is approximately differentiable at almost 
every point of Q. 

§ 13. Fundamental theorems on the contingent of 
a set In space. Following F. Roger [2], we shall now extend 
to sets in the space i? 3 , the results obtained in § 3. The proofs will 
be largely a repetition of those of §3 with the obvious verbal changes. 
We shall therefore present them in a slightly more condensed form. 

Generalizing the definitions of §3, p.264, to functions of two 
variables, we shall say that a function F(x,y) fulfils the Lipschitz 
condition on a plane set E, if there exists a finite constant N such 
that | F(x 2 , y 2 ) — F(x v y,)| < N - |> 2 — x x \ + | y 2 — y 1 |] for every two 
points (x^y^ and (x 2 ,y 2 ) of E. We verify at once that the 
graph of the function F on E is then of finite area whenever 
\E\ < +oo, and of area zero whenever, in particular, |22|=0 
(cf. Chap. II, §8; more precisely, we have, for every set E, 

In the sequel we shall make use of the following notation 
for limits relative to a set. If E is a set (in any space) and t 0 is a point 
of accumulation for E, the lower and upper limits of a function 
F(t) as t tends to t 0 on E will be written ]imint E F(t) and limsupgF(t) 

respectively. Their common value, when they are equal, will be 
written limi?F(t). 

(13.1) Lemma. Let R be a set in the space Jt 3 , d a fixed direction 
in this space and P the set of the points a of R at which contg f a con- 
tains no half-line of direction 6. Then (i) the set P is the sum of a 
sequence of sets of finite area and (ii) at each point a of P, except 
at most at those of a subset of area zero, the set R has an extreme tan- 
gent plane, for which the side containing the half -line aO is its empty side. 

In the particular case in which 0 is the direction of the positive 
semi-axis of z, the set P is expressible as the sum of an enumerable 
infinity of sets each of which is the graph of a function on a plane 
set on which the function fulfils the Lipschitz condition. 
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Proof. We may clearly suppose (in the first part of the theo- 
rem also) that 0 is the direction of the positive semi-axis of z. We 
denote, for every positive integer «, by P„ the set of the points 
(x,y,z) of P such that the inequalities \x'~ ar|^l/», \y'—y\^.l/n 
and | z' — «|<l/w imply z' — — %\-t\y' — y\] for every 

point (x',y',z') of H. We express, further, each P n as the sum of 
a sequence 1,2,... of sets with diameters less than l/». For even- 

pair of points (x v y x ,z x ) and {x 2 ,y 2 ,z 2 ) of the same set P„,*, we thus 
have |* 4 — ®,|-f |j/ 2 — yj], and if we denote by Q„. k the 
orthogonal projection of P„,* on the .ry-plane, we easily see that 
the set P„,* may be regarded as the graph of a function F„ k on Q„, k . 
Plainly |P n ,*(x 2 ,y 2 )— P„,*(.ri,;y 1 )|<w [|j’ 2 — o-J-f |y 2 — y t |] for every two 
points (x 1 ,y l ) and (x 2 ,y 2 ) of Q„, k . Thus F„, k fulfils the Lipschitz con- 
dition on Q„, k and hence (cf.p.304) A s ( P„, *)=d 2 (B(P, „.*;<?„;*)) <+°o. 
Thus P=2 1 P«.* is the required expression of the set P. 

n,k 

It remains to discuss the existence of an extreme tangent 
plane to R at the points of P. For a fixed pair of positive integers » 
and k, the function F„, k , which fulfils the Lipschitz condition 
on the set Q „, k , can be continued at once, by continuity, on to the 
closure Q„, k of this set, and then on to the whole plane by writing 
F„, k (x,y) = 0 outside Q„ lk . On account of Theorem 12.2, the function 
Pn,* is approximately differentiable at almost all the points of Q„, k . 
Hence, denoting by Q„ -k the subset of Q„, k consisting of the point j* 
of density of Q n , k at which F„, k is approximately differentiable, we 
see that and hence, that -0. 

We need, therefore, only show that R has an extreme tangent plane 
at each point of B (F„.*> <?«.*), and that, further, the half-line with 
the direction of the positive semi-axis of z is contained in the 
empty side of this plane. 

Let (f,„ *? 0 , C 0 ) he any point of B(P„.*;$„,*) and let {A 0 ,Ii 0 ) 
be the approximate differential of F„, k at the point (£„,'/«)• Let 
e<l be any positive number, and let E, be the set of the points 
(x,y)t such that 

| F n , k { x , y ) iV*(f 0 , -i 0 * " 9o)K *■[!•» -ui t | // 

Since the function F„, * is measurable, (£„,//„) is (cf. ( ’hap. VII. §3) 
a point of outer density for the set E, . Hence we can make cor- 
respond to each point (£, ^.C), sufficiently near to (; ft . //„. ), a point 

(S',ri')eE, such that: 
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(13.2) |f'— f 0 |<|f— f 0 | an d W— *7oKI* 7 — %l» 

(13.3) |f'— f|<e-|£— fol and W— *?|<e-|jy— t) 0 \. 

Remembering that C 0 —Fn, k (£ 0 > Vo)> we now write for brevity 

D„M\v’)=FnA f',»7')-Co-^o-(f'-fo)-fio (>?'-»?o)- We thus have 

Qg £ fo fo) Bo'i 1 ! Vo)— 

= D.MW)+lC-**.MWY\ + U»(F-e) + B 9 -{ii'-y)]. 
This being so, let (f, r), f) be a point of R such that each of 
the differences |f— f 0 |, \y— y 0 \ and |C— C 0 | is less than, or equal 

to, l/4» 2 . Then by (13.3), we have |f'— f|<l/« and | y' — y\^.lln, 
while, by (13.2), \F„,k(Z',y')—t 0 \^n-[\£'—£ 0 \+\y'—y 0 \]^.ll2n, and so 
|-F n .A(f ',»?')— f|<l/». Since the point (Z’,y',F„, k ( f',j?')) belongs to 
B (F-,E f )CP„, it follows that f— F n ,*(f', >?')<» -[|f— f'|+|»?— »?'|L 
and, again making use of (13.3), we deduce from (13.4) that 

C fo fo) B 0 -(y y 0 )$% 

(13.5) <|Z> n ,*(f', J? ')| + (n+|A 0 |).|r-f| + (»+|B 0 |)-h'->?| < 
^|^n,*(f',»?')| + e-(|jl 0 |+|R 0 | + w) - [|f — fol+l 7 ? — »?ol]- 

We now observe that, since (£’,r]')e E e , (13.2) implies 

|-Dn.*(f',»?')|/[|f— fol+k— »? 0 |] < |f>i.,*(f ', »?')|/[|f '— fol + l»?'— 1 »7oll < £ - 
Hence, e being an arbitrary positive number, we derive from (13.5) 

(13.6) limsup„ [C-^ 0 -^ 0 -(f-fo)-R 0 -(»?-»?„)]/[|f-f 0 |+|M 0 l<0. 

(f . i?. 0-MI.. >7... C„) 

Moreover, since {.4 0 , R 0 ) is the approximate differential of the func- 
tion F„, k at (f 0 , y 0 ) and since the point (f 0 , y 0 ) is a point of outer 

density for the set Q„, k , the plane z—C o —A o -(x—^ o )—B o -(y—y o ) = 0 
is certainly an intermediate tangent plane (cf. § 2, p. 263) of R at 
the point (f u , »?oifo)- ft * 8 therefore, by (13.6), an extreme tangent 
plane at this point, with an empty side consisting of the half-space 
z— Z^A 0 -(x— f 0 )-(- R 0 (y— y 0 ). This completes the proof. 

We shall employ in space a terminology similar to that of the 
plane (cf. jj 3, p. 264) and agree to say that the contingent of a set 
E C at a point a of E is the whole space if it includes all the 
half-lines issuing from the point a; and again, that the contingent 
of E at a point a of E is a half-space , if E has an extreme tangent 
plane at a and if contg^a consists of all the half-lines issuing from a 
which are situated on one side of this plane. We make use of 
these terms to state the analogue of Theorem 3.6: 
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(13.7) Theorem,. Given a set R in Jt 3 , let P be a subset of R at 
no point of which the contingent of R is the whole space. Then (i) the 
set P is the sum of an enumerable infinity of sets of finite area and 
(ii) at every point of P, except at those of a set of area zero, either the 
set R has a unique tangent plane, or else the contingent of R is a half -space. 

The proof of this statement, which follows directly from 
Lemma 13.1, is quite similar to that of Theorem 3.6. We need only 
replace, in the proof of the latter, the terms length, tangent and 
half-plane by area, tangent plane and half- space, respectively. 

It only remains to extend to space, Theorem 3.7. This ex- 
tension, in the form (13.11) in which we shall establish it, is essen- 
tially little more than an immediate, and almost trivial, conse- 
quence of Theorem 3.7. Its proof requires however some subsidiary 
considerations of the measurability of certain sets. 

(13.3) Lommo,. If Q i,s a set its orthogonal projection 

on the xy-plane is a measurable set. 

Proof. Let us 'denote generally, for every set E situated in 1? 3 , 
by I'(E) its projection on the xy-plane. In order to establish the 
measurability of the set P(Q), it will suffice to show that for each 
£>0 there exists a closed set PC F(Q) such that \P\^-\r(Q)\ — e. 

We express Q as the product of a sequence (Q n }n-i, 2 , of 
sets CJJ. It may clearly be assumed that the set Q is bounded and 
that, moreover, all the sets Q„ are situated in a fixed closed sphere iS 0 . 

We shall define in B 3 , by induction, a sequence i,..„ 

of closed sets subject to the following conditions for n— 1 , 2 ,...: 
(i) F n CF„-u (ii) F n CQ n and (iii) \r(F n -Q)\^\F(F n v Q)\-el2 n . 

For this purpose, we choose F 0 — <S' 0 and we suppose that the 
next r — 1 sets F n have been defined. We have QCQ n and so 
F r - 1 Q, Q=Fr-vQ, and since F r v Q r i%, with Q r , a set (ft„), there 
exists a dosed set F,CF r .. v Q r such that. \l\F,-Q)\^\F{F r r Q)\—sl2 r . 
This closed set F r clearly fulfils (i), (ii) and (iii) for n = r. 

Now let F=IIF n — \im.F n . It follows from (ii) that FCQ, and 

therefore that F(F)cr(Q). Further, F(F) is a closed set, for, since {F n \ 
is a descending sequence of closed and bounded sets, we easily see 
that P(F)=]imr(F n ). Finally this last relation coupled with (iii) shows 

that |r(f , )r>lim|r(J , J ,-Q)|>|r(F 0 -Q)|-f«!/ , W)l-«. whi(h «* om - 

n 

pletes the proof. 
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It would be easy to prove that the projection of a set (5 a< j) is the nucleus 
of a determining system formed of closed sets and thus to deduce Lemma 13.8 
from Theorem 5.5, Chap. II. We have preferred, however, to give a direct ele- 
mentary proof, based on a method due to N. Lusin [3]. The same argument 
shows that any continuous image of a set (5^) is measurable. 

It has been proved more generally (vide, for instance, W. Sierpinski 
[II, p. 149], or F. Hausdorff [II, p. 212]) that any continuous image of an 
analytic set -(in particular, of a set measurable (*©)) situated in /f 8 is an analytic 
and, therefore, measurable set. 

(13.9) Lemma. Given a set R in i? 3 , let Q be the set of the points 
(f, ? 7 , f) of R which fulfil the condition : 

(A) the part of the contingent of R at the point which 

is situated in the plane is wholly contained in one or other of 

the two half -spaces y^rj and y^rj. 

Then the orthogonal projection of the set Q on the xy-plane is 
of plane measure zero. 

Proof. We may clearly suppose that the set R is closed (for 
the contingent of any set R coincides, at all points of R , with that 
of the closure of R). 

Let us denote generally, for any set E in R z and any number f, 

by E [ ' ] the set E [(f,y,2) e E]. It follows from Theorem 3.6 that, 
(.v.o 

for every f, the plane set R { ^ has an extreme tangent, parallel to 
the z-axis at every point of Q ,?l except those of a set of length zero. 
Hence, by Theorem 3.7, the projection of Q on the xy- plane is of 
linear measure zero on each line x— f of this plane, and, in order 
to prove that this projection is of plane measure zero, we need 
only show that the latter is measurable. 

Let us denote, for each pair of positive integers k and n, by 
A k>r i the set of the points (f, rj, C) of R such that the inequalities 

(13.10) \x— fj -f \y — 1 ?| + \z— f|<l/n and \x— f|<[|y— »?| + \z— C\]/n 

imply, for any point (x,y,z)eR, the inequality y— f| + |z— Cl]/*. 
Similarly, we shall denote by B k n the set of the points (f, rj, £) 
of R for which the inequalities (13.10) imply, for every point 
(x, y, z) of R, the inequality y— ^ - [ja?— f| + |z— C|]/fc. Writing 

A =HE^k, n and B=f]UBk,n, 

k n k n 

we find that Q — A-\-B. On the other hand, since tne set R is, by 
hypothesis, closed, we observe at once that each set A kt „, and like- 
wise each set B k , n , is closed. The sets A and B , and so the set Q 
also, are thus sets (5^)> an( i i n view of Lemma 13.8, the projection 
of Q on the xy- plane is a measurable set. 
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(13.11) Theorem. Given a set R in let P be a subset of R at every 
point of which the set R has an extreme tangent plane parallel to a fixed 
straight line D. Then the orthogonal projection of P on the plane per- 
pendicular to D is of plane measure zero. 


Proof. We may clearly suppose that the straight line 1) is 
the 2 -axis. Let us denote by P x the set of the points of P at which 
the extreme tangent plane, parallel, by hypothesis, to the 2 -axis 
is not, however, parallel to the yz- plane. Similarly, P a will denote 
the set of the points of P at which the extreme tangent plane is 
not parallel to the xz-plane. We then have P=Pj + P 2 . 

Now we observe at once that each point (£, >?,£) of P l fulfils the 
condition (A) of Lemma 13.9. It therefore follows from this lemma, 
that the projection of P 1 on the aa/ -plane is of plane measure zero. 

By symmetry, the same. is true of the projection of the set P a . 
The proof is thus complete. 


# 14. Extreme differentials. Let F be a finite function 
of two real variables. A pair of finite numbers {A,B} will be called 
upper differential of F at a point (x 0 ,y 0 ) if, when we write z 0 =F(x 0 , y 0 ), 
(i) the plane z—z n =A-(x—x 0 )-\-B (y—y 0 ) is an intermediate tan- 
gent plane of the graph of the function F at the point (x 0 ,y 0 ,z 0 ) and 


(ii> 


Iim fl „T, y 0 )— A-(x— a?o)— Bjiy— y 0 ) 

I* - + \v — Vo\ 


= 0 . 


These conditions may clearly be replaced by the following: (i x ) the plane 
z — z 0 =A-{x — x 0 ) + B {y — y 0 ) is an extreme tangent plane of the graph 
of F at (x 0 ,y 0 ,z 0 ) with the empty side z— z u ^A-(x— x 0 ) + B-(y — y 0 ), 
and (ii x ) lim sup F(x,y)^F(x 0 ,y 0 ). 

(x,y)-+(x 0 ,y Q ) 

The definition of lower differential is similar, and the two 
differentials, upper and lower, will be called extreme differentials. 

If a function F has a total differential (cf . § 12, p. 300) at a point, 
this differential is both an upper and a lower differential of F at 
the point considered. Conversely, if a function F has at a point 
(x 0J y 0 ) hoth an upper and a lower differential, these are identical 
and then reduce to a total differential of F at (x 0J y Q ). 


For a finite function of one real variable F , the existence of an upper dif- 
ferential at a point x 0 is to be interpreted to mean that F + (x 0 )—F~~(x 0 ) 4 oo (in 
which case the number F*(x Q )=F~(x Q ) may be regarded as the upper differential 
of F at x 0 ). There is a similar interpretation for the lower differential of functions 
of one variable. This interpretation brings to light the relationship between the 
theorems of this § and those of § 4. 
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We propose to give an account of researches concerning the existence 
almost everywhere of total, approximate, or extreme differentials. These re- 
searches were begun by II. Rademacher [3], who established the first general 
sufficient condition in order that a continuous function be almost every- 
where differentiable. W. Stepanoff f 1 ; 3] later removed from Rademacher's 
reasoning certain superfluous hypotheses, and obtained a more complete result, 
valid for any measurable function: In order that a function F which is meas- 
urable on a set A 1 , should be differentiable almost everywhere in E , it is necessary and 
sufficient that the relation lim sup \F(r,y) — F(£ t r])\/[\x — £|-f|y — *;|]<-j-oo should 

(x ,t/hH£,n) 

hold at almost all the points (£,*7) of E. (Pertain details of Stepanoff's proof, par- 
ticularly those concerning measurability of the Dini partial derivates, have been 
subjected to criticism (of. J. C. Burkill and U. S. Haslam -Jones [1].) U. S. 
II aslam -Jones [1] extended further the result of Stepanoff, and by introducing 
the notion of extreme differentials (which he called upper and lower derivate 
planes), obtained theorems analogous to those of Denjoy for functions of one 
variable. The researches of Haslam- Jones h$ve been continued and completed 
by A. J. Ward [ 1 ; 4] who, in particular, removed the hypothesis of measur- 
ability in certain of Haslam -Jones's theorems. 

We shall derive the results of Haslam -Jones from the theorems of the 
preceding § (cf. F. Roger [3]; direct proofs will be found in the memoirs of 
Haslam -Jones and Ward referred to, and in the first edition of this book). 

In what follows, we shall make use of some subsidiary con- 
ventions of notation. If F is a function of two real variables and t 
denotes a point (x,y) of the plane H 2 , we shall frequently write 
F(t) for F(x,y). If t l = (x x ^y l ) and t 2 ~(x 2 ,y 2 ) are two points of the 
plane, 1 1 2 — will denote the number \x 2 — x x \ + | y 2 — Jfil- 

Given in the plane two distinct half -lines issuing from a point t 0 , 
each of the two closed regions into which these half-lines divide 
the plane will be called angle. The point t 0 will be termed vertex 
of each of these angles. 

We shall begin by proving a theorem somewhat analogoUvS 
to Theorem 1.1 (ii). 

( 14 . 1 ) Theorem . Let F be a finite function in the plane H 2 and 
let E be a plane set , each point r of which is the vertex of an angle A(r) 
such that limsup / i(,)F(f)<Um sup F(t). Then the set E is of plane 

t->r /-> r 

measure zero. 

Proof. Let us denote, for each pair of integers p and q , by E p , v 
the set of the points r of E at which Umsup^ (r )F(£)<p/g<liin supF($). 

t-+r t->T 

For fixed p and q , we observe that no point reE p , q is a point of 
accumulation for the part of the set E p , q contained in the interior of 
the corresponding angle A(x). Hence, no point of the set E p%q can 
be a point of outer density for this set. Each of the sets E p , q is 
thus of plane measure zero, and the same is therefore true of the 
whole set E. 
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As we easily see, in virtue of Theorem 3.6, each set E p%qt and consequently 
the whole set E , is the sum of a sequence of sets of finite length (this of 
course, implies that E is of plane measure zero). Cf. A. Kolmogoroff and 
J. Verfcenko [I]. 

(14.2) Theorem . Let F be a finite function in the plane . Then 

(i) if P is a plane set each point x of which is the vertex of an 
angle A( x) such that 

(14.3) lim^ (r) \F(t ) — F(t)\ /\t — r|=+oo, 
the set P is necessarily of plane measure zero\ 


(ii) if Q is a plane set each point x of which is the vertex of an 
angle A 0 ( x) such that 

(14.4) lim sup^ot^)— F(z)]l\t— r|<+oo, 

t~>T 

the function F necessarily has an upper differential at almost all the 
points of Q ; 

(iii) if R is a plane set each point x of which is the vertex of two 
angles A x (x) and A 2 (x) such that 


and 


lira sup^,) [F(«)— F(T)]l\t— t|<+ oo 

t~>Z 

lim inf^. (t ) [F(t)— F(z)~\l\t— r|>— oo, 

t+T 


the function F is totally differentiable at almost all the points of R. 


Proof, re (i). By Theorem 13.7 the set B (P;P) has, at each 
of its points except those of a subset of area zero, an extreme* tan- 
gent plane. The latter is seen to be necessarily parallel to the c-axis. 
Hence, by Theorem 13.11, the set P, as the projection of B(P;P) 
on the ^y-plane, is of plane measure zero. 

re (ii). It clearly follows from (14.4) that, at each point r of Q , 
we have hmsup^P^)^^*)- Hence, by Theorem 14.1, we have 

lim sup F(t)^F(r) at all the points z of Q, except at most those 
of a set Q 0 of measure zero. 

Let us now denote by B the graph of the function F (on the 
whole plane). Let B x be the set of the points of B(F;Q) at which 
the set B has no extreme tangent plane, and B 2 the set of the points 
of B(F\Q) at which such a tangent plane exists, but is parallel to 
the 2 -axis. Finally, let Q x and Q 2 l>e the projections of the sets B } 
and B 2 respectively, on the xy-plane. On account of Theorem 13.7, 
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we easily verify that A. i (B 1 )= 0, and so, that 1^1 = 0. Similarly, 
it follows at once from Theorem 13.11 that |@ 2 | = 0. Now, if (f, rf) 
is any point of Q — (Q 1 + Q 2 ), the set B has at (£,r],F(£,r])) an ex- 
treme tangent plane of the form z—t=M(lj, rj)-(x—l;)+ N(£, rj)-(y—r]), 
where M(£,r]) and N (£,rj) are finite numbers. We observe further 
without difficulty that the half-space 

s— T])-(x— £) + N(£,ri)-(y—r)) 

is an empty side of this plane. Hence (cf. p. 309), at each point 
(f, y) of the set Q— (Q 0 + Qi + Q 2 ), the pair of numbers (iH(£, rj),N(£, rj)) 
is an upper differential of the function F. This completes the 
proof, since K> 0 +#i + # 2 l = °- 

Finally, (iii) is an immediate consequence of (ii). 

In the ease in which the function F is measurable, we can complete 
part (i) of Theorem 14.2 (which itself generalizes Theorem 4.4). Thus, if F 
is any measurable function of two variables , the set of the points (x,y) at which 

lim \F(x+h,y) — F(x,y)\ fh— -f oo, is of plane measure zero. 
h -+ 04 

This proposition plainly follows from Theorem 4.4, except for meas- 
urability considerations, essential to the proof, which seem to require general 
theorems on the measurability of the projections of sets (*©) (cf. p. 308). 

We conclude with the following theorem (cl. A. J. Ward [1] 
and the first edition of tills book, p. 234) which, in view of Theo- 
rem 14.2 (i), (ii), may be regarded as an extension of Theorem 9.9 
to the functions of two variables: 

(14.5) Theorem. If F is a finite function of two variables, which 
is measurable on a set E and which has an extreme differential at each 
point of a set QC E, then this differential is, at the same time, an 
approximate differential of F at almost all the points of Q. 

Proof. On account of Lusin’s theorem (Chap. Ill, § 7) we 
may clearly suppose that the set E is closed and that the function 
F is continuous on E. Let us suppose further, for definiteness, that 
the function F has an upper differential at each point of Q, and 
let us denote, for each positive integer n, by Q n the set of the points 
t of Q such that, for every point t', |C — 1| <l/n implies the 
inequality F(t')~- F(t)<n-\t’~t\. Finally, let each set Q n be ex- 
pressed as the sum of a sequence {()„,*}*_- 1 . 2 , .. of sots with diameters 
less than 1/a. We shall have Q-XQn,k- 

n,k 
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We see at once that the function F fulfils the Lipschitz con- 
dition on each set and therefore also on each set Q„ ik . Hence, 
by Theorem 12.2, the function F has the approximate differential 
{ FL Px (x,y ), Fap u (x,y)\ at almost every point (x,y) of each set Q„, k , 
and therefore at almost every point ( x,y ) of the set Q. 

Let us, on the other hand, denote, for each point (x,y) of Q, 
by [A(x,y), B(x,y)} the upper differential of F at this point. It 
follows at once from the definition of upper differential, p. 309, that 
E7(x,y)>A(x,y)^Ft (x,y), and similarly F^(x,y)^B(x,y)^Fy(x,y), 
at each point (x,y) of Q. Hence, at each point (x,y) of Q at which 
the approximate partial derivates F* Px (x,y) and exist, 

we have A(x,y)=F* Px (x,y) and B(x,y)—F^ y (x,y). The upper 
differential {A(x,y), B (x, y)} of the function F thus coincides at 
almost all points (x,y) of Q with the approximate differential of F. 



NOTE I. 


On Haar’s measure 

by 

Stefan Banach. 

§ 1 . This Note is devoted to the theory of measure due to Al- 
fred Haar [1]. Haar’s beautiful and important theory deals with 
measure in those locally compact separable spaces for which the 
notion of congruent sets is defined. His measure fulfils the usual 
conditions of ordinary Lebesgue measure: congruent sets are of 
equal measure and all Borel sets (more generally, all analytic sets) 
are measurable. The theory has important applications in that 
of continuous groups. 

To complete the definitions of Chap. II, § 2, we shall say that 
a set situated in a metrical space is compact, if every infinite subset 
of the set in question has at least one point of accumulation. A met- 
rical space is termed locally compact if each point of this space 
has a neighbourhood which is compact. 

$2. In what follows we shall denote by E a fixed metrical 
space, separable and locally compact, and we shall suppose that, 
for the sets situated in E , the notion of congruence is defined 
so as to fulfil the following conditions: 

i v A = B implies BzzA, A^B and B~C imply A^C; 

1 2 . If A is a compact open set and A?=B, then the set B is itself 
open and compact ; 

1 3 . If A^B and {A n } is a ( finite or infinite) sequence of open 
compact sets such that AC%A nj then there exists a sequence of sets 

n 

{£„} such that BC2 jB„ and such that A n ^B n for n — 1 , 2 ,...; 

n 

1 4 . Whatever be the compact open set A, the class of the sets con- 
gruent to A covers the whole space 22; 

1 5 . If {$„} is a sequence of compact concentric spheres with radii 
tending to 0, and {G„} is a sequence of sets such that G n s*S„, then the 
relations a=lima„ and b—]imb„, where a„eG n and b„eG„, imply a—b. 

n n 
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§& Given two compact open sets A and B , the class of the 
sets congruent to A covers, by i 4 , the set B. It therefore follows 
from the theorem of Borel-Lebesgue that there exists a finite system 
of sets congruent to A which covers B. Let h(Z?,A) denote the least 
number of sets which constitute such a system. 

It is easy to show by means of q— i 5 that, for any three com- 
pact open sets A, B and C, the following propositions are valid: 

ii x . C(ZB implies h(C, A)^.h(B, A)) 
h(B + C,A)<b(£,A) + h(C,A); 

11 3 . B = C implies h(B J A) = h(C,A)-, 

11 4 . h(B,iKh(B,C)-h(C,4); 

11 5 . If g(A,B)>0 and if { S n } is a sequence of compact con- 
centric spheres with radii tending to 0, then there exists a positive 
integer N such that , for every n>N , 

(3.1 ) h (A + B, S n ) = h {A, S n ) + h (B, 8 tt ). 


All these propositions are obvious, except perhaps ii 5 . To 
prove the latter, let us suppose, if possible, that there exists an 
increasing sequence of positive integers {w,} such that (3.1) does 
not hold for any of the values n=ni. There would then exist a se- 
quence of sets {Oi} such that G^ ^ S n _ while A- 6/ =4=0 and B*Gr- 4=0. 
Consider now arbitrary points a, e A Gt and bi e £•(?,. Since the 
sets A and B are compact, the sequences {a/} and {ft,} contain re- 
spectively convergent subsequences and {6^}. Let a==lima^ 

j J j J 

and 6 = lim£q. By i 5 we must have a=b , and this is impossible 
j J 

since, by hypothesis, ^(A,B) = 0. 

We shall now suppose given a fixed compact open set G and 
a sequence {$„} of concentric spheres, with radii tending to 0, which 
are situated in G and therefore clearly compact. For every compact 
open set A , we write 


l»(A) = 


h (A,S n ) 
h((?, S n ) 


We then have, by ii 4 , 


hU,S fl )<h(i,6)-h(6,4) and h«?,S„Kh(<?, A)-h(A, S n ), 

and hence for each w = 1,2,..., l/h(0, A)^l„(A)^h(A f G). 

Thus, (i/i(A)} is a bounded sequence whose terms exceed a fixed 
positive number. 
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»4. We now make use of the following theorem (cf. 8. Ba- 
nach fl, p. 34] and S. Mazur [1]), in which {£„} and {t)„} denote 
arbitrary bounded sequences of real numbers, a and b denote real 
numbers, and the symbols lim, limsup and liminf have their usual 
meaning: 

To every bounded sequence {£„} we can make correspond a num- 
ber Lim £„, termed generalized limit, in such a manner as to fulfil 

n 

the following conditions: 

1) Lim (a$ n +brj n ) — a- Lim £ n -)- b-Lim rj n , 

n n n 

2) liminf ( n ^.Lim £ n <lim sup £ n , 

n n n 

3) Lim f n+) = Lim f n . 

n n 

The last condition implies that the generalized limit remains unal- 
tered, when we remove from a sequence a finite number of its terms. 

Let us now write, for every compact open set A, 

(4.1) 1(A) = Lim l n (A). 

n 

We then have, for any compact open sets A and B: 
iiij. 0<Z(A)<-(- 

°°» 

iii t . ACB implies l(A)^l(B); 
iiij. A = B implies l(A)=l(B)-, 

111 4 . + 

111 5 . q(A,B)> 0 implies l(A + B)—l(A)+l(B). 

§5. This being so, we denote, for an arbitrary set XQE, by 
r(X) the lower bound of all the numbers £l(A n ) where {Jl„} is any 

n 

sequence of compact open sets such that XC2jA„. We shall show 

n 

that the function of a set r, thus defined, fulfils the following con- 
ditions: 

1° We have always O^r(X) and there exist sets X for which 
we have 0</’(X)<-f-oo; this is, in particular, the case of all compact 
open sets X; 

2° X,CX, implies r(X l )^r(X t ); 

3° XC£X„ implies 

4® e(Xj,X t )>0 implies r(X 1 +X,)==J 1 (X 1 )+r(X,); 

5® X^X, implies r(X 1 )=/ , (X 1 ). 
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Proof. 1°. Let I be a compact open set. We have, by defi- 
nition, r(X)t^l(X)<+ oo. 

On the other hand, there clearly exists for each e>0 a (finite 
or infinite) sequence of compact open sets {4,,} such that XC}jA n 

n 

and r(X) + e^£l(A„). Let 8 be any sphere contained in A’. Since 

n 

the set S is closed and compact, this set, and a fortiori the set 
is already covered by a finite subsequence {A n ) of {A n }. In view 
of iii 2 and iii 4 , we thus have 

l(S)^l(ZA )^Zl(A n .)^2l(A n )^r( X) + e. 

i 1 i ' n 

Hence, e being arbitrary, it follows that l(8)^r(X), and finally, 
by iiii, that 0<T(X). 

2° and 3° are obvious. 

4°. g(X x ,X 2 )>0 implies that there exist two open sets G 2 
and G 2 such that X X C(? X , X 2 CG 2 and Q(G t ,G 2 )> 0. On the other 
hand, there exists for each ?>0 a sequence of compact open sets (4„) 
such that 

(5.1) X x +A : 2 cZA n and r(X 2 + X t ) + e^2l{A„). 

n n 

Write A i n ) =A„-G 1 and A^= A„ G 2 . Since the sets A ( „° and A„* 
are open and compact, and since their distance, like that of G x and G 2 . 
is positive, we have, on account of iii 5 ‘ and iii 2 , 

(5.2) l(A^) + l{A^)=l(A^+Af)^l(A n ). 

But, since on the other hand and X 2 c£a { „\ we 

n n 

have the inequalities r(X 1 )^^l{A^ ) ) and /’(A 2 X^’l(A < „ 2> ), so that, 

rt n 

bv (5.2), r(Z x )+/’(I,Ki;i(l„). Hence by (5.1), e being arbi- 

n 

trary, we obtain r(X x ) + r(X 2 Kr(X x +X 2 ), and finally by 3°, 

r(z x )+r(A 2 )=r(x x +A 2 ). 

5° follows at once from i 3 and iii 3 . 

$6* It follows from the properties 1° — 4° of the function r 
that the latter is an outer measure in the sense of Oarath^odory 
(cf. Chap. Ill, § 4) and therefore determines in E a class of sets 
measurable (Cr)* that we shall call, simply, measurable sets . We 
see at once that for each set X in E the number r(X) is the lower 
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bound of the measures (F) of the open sets containing X. It follows, 
in particular, that F is a regular outer measure (cf. Chap. II, §6). 

Finally, since the space 2? can be covered by a sequence of 
measurable sets of finite measure (e. g. by a sequence of compact 
spheres), we easily establish, for the measure F, conditions of meas- 
urability (£/) similar to those of Theorem 6.6, Chap. III. In part- 
icular, we shall have: 

(6.1) In order that a set E be measurable , it is necessary and 
sufficient that there exist a set (©<<) containing E and differing from 
it by at most a set of measure zero . 

§ 7. We conclude this note by giving two examples of spaces JE 
with the notion of congruence subject to the conditions of § 2. 

Example 1. Let E be a metrical space which is separable 
and locally compact, and suppose that, among the one to one trans- 
formations, continuous both ways, by which the whole space E 
is transformed into the whole space E , there exists a class dll of 
transformations subject to the conditions: 

1 ) T € dll implies T 1 t dll ; 

2) If T 1 €dli and T 2 edH 1 then T x T 2 edU ; 

3) For every pair a, b of points of E , there exists a transformation 
T e dll such that T(a)=b ; 

4) If (a-,) and are two convergent sequences of points of E 
such that lim = lim , and if is a sequence of transformations 

n n 

belonging to dll such that the sequences {T n (a n )} and { T„(b n )) are con- 
vergent also , then we have lim T n (a n ) = lim T n (b„). 

n n 

Two sets ACE and BCE will be termed congruent , if there 
exists a transformation T e dll such that T(A) = B (where T(A) 
denotes the set into which A is transformed, i. e. the set of all the 
points T(a) for which aeA). 

It is easy to verify that the conditions i, — i 5 are fulfilled. 

As special cases of such spaces E we may mention: Euclidean 
^-dimensional space with dU interpreted as the class of all trans- 
lations and rotations; the 3-dimensional sphere with dU interpreted 
as the class of all rotations. 

Let us observe that, in the space considered, the sets which 
are congruent to open sets are themselves open. On the other hand, 
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on account of 5°, p. 316, the sets congruent to sets of measure (7 1 ) 
zero are themselves of measure zero. It follows therefore from (6.1) 
that in the space E considered the sets which are congruent to meas- 
urable set 8, are themselves measurable. 

Example 2. Suppose that a metrical space E, separable and 
locally compact, constitutes a group, i. e. that with each pair a, b 
of elements of E there is associated an element ab of E, called pro- 
duct, in such a manner that the following conditions are fulfilled: 

1) ( ab)c—a{bc ) (whatever be the elements a, b and c of E); 

2) there exists in E a unit-element 1 such that we have 1 a=a l= a 
for every aeE-, 

3) to each element aeE there corresponds an inverse element 
1 e E which fulfils the equation aa 1 — 1 . 

Suppose further that E fulfils the conditions: 

4) if lim a n = a and lim b„~b, then lim a„b„ — ab-, 

n n n 

5) if lima„ = a, then lima„ ’ = a '. 

n n 

Given any element ceE and any set BCE, we denote by cB 
the set of all the elements aeE such that a—cb where beB. 

Given an element a of E, we write, for every element xeE, 
T„(x) = ax. Thus each element a of E determines a transformation T a , 
clearly one to one and continuous both ways, of the space E into 
itself. Denoting the class of all these transformations by dll, we 
see at once that the conditions 1) — 4), p. 318, are fulfilled. In ac- 
cordance with the definition of congruence employed in Exam- 
ple 1, two sets A and B in the space in question are congruent if 
there exists an element c, such that B=cA. 



NOTH II. 

The Lebesgue integral in abstract spaces 

by 

Stefan Banach. 

Introduction. 

In this note we intend to establish some general theorems 
concerning the Lebesgue integral in abstract spaces. This subject 
has been discussed by several authors (for the references see this 
volume, pp. 4, 88, 116, 156 and 157). Our considerations differ from 
those of other writers in that they are not based on the notion 
of measure. 

Let us fix a set of arbitrary elements H as an abstract space. 
We shall denote real functions (i. e. functions which admit real 
values) defined in H by x(t)j y(t ), z(t),... where te H, or simply by 
x,y, 2 ,... . A set 2 of real functions defined in JET will be called linear 
if any linear combination, with constant coefficients, of two ele- 
ments of 2, also belongs to 2. 

Let 2 be a linear set of functions defined in H. A functional F 
defined in 2 is termed additive if for any pair of elements x and y 
of 2 and any real number a, we have F(x y) = F (x) F (y) and 
F(ax) = a-F{x). The functional F is non-negative if F(x )^ 0 for any 
non-negative function x e 2. 

We say that a functional F defined in 2 is a Lebesgue integral 
(2-integral) in 2 if the following conditions are satisfied: 

A) The set 2 is linear; 

B) the functional F is additive and non-negative; 

C) if 1° C 2 and Jff2, 2° \z„(t)\^M (t) for 1,2,... and 
te H } and 3° lim z n (t)^z(t) for te then zeH and limf^*,,)^/ 1 ^); 

n it 
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D) if ze2, F(z) = 0 and |y(f)|<z(() for teH, then ye £ and 
F(y) = 0; 

E) if 1° {z„)Cfl, z a (l)^Zn+t(t) for n = l, 2 ,..., 2° lim *„(«) = £(<) 

n 

for te H, and 3° lim F(z„)<+oo f then ze£ and lim F(z n ) — F(z). 

n * 

The Lebesgue integrals considered in this note will moreover 
satisfy the condition: 

R) If z e £, then |z|e£. 

In Part I, a condition is established under which an additive 
and non-negative functional defined in a linear set of functions (£, 
may be extended to an r- integral on a certain set £ containing <E. 
The X’-integral and the set £ will be explicitly defined. 

In Part II we admit that H is a metrical and compact space. 
We consider an ^-integral defined in sets containing all functions 
which are bounded and measurable in the Borel sense. It is shown 
that each T-integral of this kind is determined by the values which it 
admits for continuous functions. Conversely, any additive and non- 
negative functional defined for all continuous functions may be 
extended as an f-integral to the class of functions measurable (®). 
We thus obtain the most general r'- integral defined for all functions 
bounded and measurable (“©). 

In Part III we deal with an analogous problem supposing 
that H is the unit sphere of the Hilbert space. In particular, the 
integral of a continuous function is expressed by explicit formulae. 

I. Abstract sets. 

§ 1. We shall employ the following notations: 

1. x^y if x(t)^y(t) for every teH; in particular x^O means 
that x(t)^0 for teH; 

2. |x|=|a;(t)| is the modulus of x(t) in the ordinary sense; 

3. max (x,y)=^(x+y+\x— y\), min {x,y)=l(x+y— \x-y\); 

4. lima?„ = a; means that lim x„(t) — x(t) for teH; the relations 

n " 

lim sup x„ = x f lim inf x n —x are defined similarly; 

n n 

5. j(a?+|x|), a?= — |o?|) (cf. Chap. I, p. 13). 
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§ 2 . For the rest of Part I of this note we shall fix a set d of 
real functions defined in H, and a functional f(x) defined for xed, 
subject to the following conditions: 

(11) The set d is linear; 

(1 2 ) if xed, then |a;|e(E; 

(ii,) the functional / is additive; 

(11 2 ) the functional / is non-negative; 

(11 3 ) if 1° (x„)C(£ and Med, 2° for n— 1,2,..., 

and 3° limx„=0, then lim/(«„) = 0. 

n n 

It follows immediately from the conditions (i) that for any pair 
of elements x and y of d, max (x,y), min (x,y), x and x also be- 
long to (£. It follows further that the condition (ii 3 ) is equivalent to 
the following condition: 

(ii 3 ) If 1° (x„)C(£ and med, 2° .a?„>ra for n — 1,2,..., 
and 3° lim inf ^ 0, then liminf/(ic„)^0. 

n n 

§ 3 . We shall establish the following 

Theorem 1 . If the set G and the functional f satisfy the con- 
ditions (i) and (ii), then there exists an £- integral F , defined in a set 2 
containing G, such that F(x) = f(x) whenever xe(£; moreover , this 
integral satisfies the condition E). 

The proof will result from several lemmas. 

§ 4 . We denote by 2* the set of all functionals z(t) defined 
in H for each of which there exist two sequences {;r„}CG, {t/„)CG 
such that 

(1) lim \rdx n ^z'^\\m sup y n . 

n n 

It is easily seen that the set 2* is linear and that GC 2*. 

Given a function z e 2*, we shall term upper £ integral of z 
the lower bound of ail (finite or infinite) numbers g for each of which 
there exist a function med and a sequence of functions {x n } be- 
longing to G such that x n ^m for n = 1 , 2 ,..., lim inf z and 

n 

</= lim inf f(x n ). 

n 

The definition of the lower £- integral is analogous to that of 
the upper xMntegral. The upper and lower ^-integrals of a function 
**2* will be denoted by p(z) and q(z) respectively. We obviously 
have q(z)——p(—z). 
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*5. The sequence !/(#„)) in the above definition of the upper 
^-integral, may obviously be supposed convergent (to a finite limit 
or +=>o). Further, if {x„)CG, meG, 2 ^ 0 , x„^m for n= 1,2,... and 
lim inf x n ^ 2 , then lim x„=0 and consequently, by the condition 

n n 

(ii 3 ), §2, lim/(x„) = 0. Hence, if ze 2*, 2 > 0 and p(z)<P<-\- oo, 

n 

there always exists a. sequence of non-negative functions {x„} belonging 
to G such that liminfx „^2 and f(x„)<P for n— 1 , 2 ,... 

n 

Lemma 1. For any function xe(£ we have p{x)=f(x). 

Proof. Writing x n —x and m—x , we have 
(1) liminf x n ^x and x n ^m for n— 1 , 2 ,..., 

n 

whence p(x)^.f(x). On the other hand, if x u x 2 , ... and m are 
any functions which belong to G and satisfy the relations (1), then 
liminf (x„—x)^0 and x n —x^m—x for »=1,2,„. . It follows from 

II 

(iiii), §2, that liminf/(.r„— x)>0, i.e. liminf f(x„)^f(x). Thus 

n n 

p(x)>/(x), and finally p(x)~f(x). 

Lemma tl. If 2, <-2*, 2 2 <-2* and if, moreover, 
p(2 2 )<+3o, then p( 2 x + 2 2 Kp(Si)+ p(z 2 )- 

Proof. Let P, and P 2 be arbitrary numbers such that p(z l )<P l 
and p{z 2 )<P 2 . There exist two sequences (x 1 ,, 0 ), of functions 

belonging to G and two functions m, t G and m 2 ^ G such that 
liminf x!/’ > 2 , and lim /(x„ ) )<P y for j — 1,2 and such that 

n n 

for 1,2 and n — 1,2,.... Therefore, writing x„ — ** n( l 

m=w 1 +»» 3 , we have liminf.r„^ 2 j-f- 2 2 and x„ > m for a --1,2,... . ("on- 

ll 

sequently p(z x f- 2 g )^lim/(x„) = lim/(xl I n ) fiim/(x‘, i, )<P l -|- P 2 > whence 

;i n ii 

p(2, + 2 2 )#*p(Zi) + p(z 2 ). 

Lemma !i. For any function ze 2*, we have p(z)^q(z). 

Proof. Since q(z)—--~p(—z) (cf. §4), the inequality p(:)^q(~) 
is obvious if one of the numbers p(s) or p( — 2 ) is + 00 ; while, if 
p ( 2 )< -f -00 and p(— c)<-(-oo, it follows immediately from Lemma 2. 
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Lemma 4. If 2e2*, p(s)<+°°, then also p(£)<+°° and 
p(z)=p(z)+p(e). 

Proof. Given an arbitrary finite number P>p(z), there exist 
a function mt(E and a sequence { x „ J of functions belonging to CE 
such that x n ^m for w=l,2,..., liminf#,,^* and lim/(a? /1 )<P. Note 

n n 

that Xn^iriy and consequently f(x„)^.f{x„)—f(m), for n=l,2,..., 
whence p(«Xlim inf /(£„)< +oo. Again 

n 

P>lim f(x„)^t lim inf / (x„) + iim inf / (x„) > p (z) + p (z), 

n n n 

and therefore p(z)^p(z)-\-p(z)] whence, in virtue of Lemma 2, 

p(z) = p(z) + p(?). 

Finally, we mention two propositions which are directly obvious: 

Lemma 5. If z 1 e2* f z 2 s2* and z l ^.z 2 , then p(z x )^p(z 2 )) in 
particular , if 4re2* and z^ 0 , then p(z)^0. 

Lemma 6\ If ze2* y then p(Xz)=Xp{z) for any non-negative 
number X. 

$6. We shall now denote by 2 the set of all functions ze2* 
for which p(z)=^q(z) 4=00. The following proposition is an immediate 
consequence of Lemmas 2 and 6: 

Lemma 7. If z x e 2 and z 2 e 2, then (X x z x + X 2 z 2 ) e 2 and 
p(X x z x + K z 2 )~Kv( z i)+ A*p(z 2 ) f or an y V a ^ r °f finite numbers X x and X 2 . 

Lemma 8. If ze 2, then \z\*2. 

Proof. Since \z\ —z—z y it is enough to prove that !e2 and 
ze 2. To this end, let us remark that, in virtue of Lemma 4, 
p(z)<+°o, p(z) > — 00 and p(z) = p(z) + p(z) y by symmetry, 
q(z)> — co, g(£)<+oo and q(z) = q(z) J f- q(z). Since, by hypothesis, 
p(z) = q(z ), it follows that [p(z) —q(z)]+[p(z) —q(z)] — 0, and so 
by Lemma 3, p(z) = q(z) + 00 and p(z) = q(z) 4= 00. 

Lemma 9. If z is the limit of a non-decreasing sequence {z„} 
of functions belonging to 2 and lim p(z„)<+oo, then zt 2 and 
p(2)=limp(3„). 

n 

Proof. We can clearly assume (by subtracting, if necessary, 
the function z x from all functions of the sequence {2,,}) that z x — 0. 
Writing to„=z„ +t — z„ for n— 1,2,..., we shall now follow an argu- 
ment similar to that of Theorem 12.3, Chap. I. First, we have z^-z„ 
and p(z„) = q(z„) for every n, and so 

q (z) > lim q(z„) = lim p(z„). 

n n 


(i) 
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To establish the opposite inequality, let e be an arbitrary 
positive integer and let us associate (ef. the remark at the begin- 
ning of § 5) with each function w n a sequence . of non-negative 

functions belonging to (£ such that 

(2) liminf and (3) /(xj, /r) )<p(w /l ) + e/2'\ 

k 

k 

Let us write The functions y k clearly belong to (£ and, 

n 1 

by (2), we have iim inf y k 7z2u' h = z. On the other hand, in virtue 

h k 

k 

of (3), we find f(y k )^^p(w n ) + e < p(z* +l ) + t < lim p(z) e for 

n - 1 1 

k- 1,2,.... Therefore, p(z) < lim inf f(y k ) < lim p(z k ) -f- e, and 

k k 

since e is an arbitrary positive number, this combined with (1) gives 
0^.p(z) — q(z) lim p(z k )< + ^o, which completes the proof. 

k 

Lemma 10. If M e 2 and is a sequence of functions belonging 
to 2 such that for n~l, 2 ,..., then , putting g^= lim inf z„ and 

h — lim sup 2„, we have ge 2, he 2, and 

n 

p (</);$; lim inf p(z„Xlim sup p(z„)^_p(h). 

n n 

Consequently, if the sequence (z„} is convergent and z = lim z„, 
then p(z) = lim p(z„). ” 

n 

Proof. The lemma corresponds to Theorem 12.11, Chap. 1, 
and its proof is analogous to that of the latter. Let us write, for 
each pair of integers i and j>i, g lj = min (z ( .,z ( + z ; ). The se- 
quence {p } j+1 is non-increasing, and consequently the sequence 

{M —gi/)j , , + l is non-decreasing. Let g, = lim gij. Since the func- 
tions g tJ clearly belong to 8, it follows from Lemma 9 that 
M—g,e 8 and p(M—g i ) = limp(M—g ij ), i. e. g t e 8 and p(<jr ( )=limp(p y ). 

Hence, applying again Lemma 9 to the non-decreasing sequence {g) 
which converges to g, we obtain ge 8 and 

p(0) = lim p (</,)< lim inf p(z,). 

i i 

By symmetry we have the analogous result for h and the 
proof is complete. 

We shall conclude this § by mentioning the following lemma 
which is an immediate consequence of Lemma 5: 

Lemma 11. If ze2, 2>0 and p(z) = 0, then any function x 
such that |x| belongs to 8 and for any such function x we have p (x) = 0. 
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§7. Let F(x) = p(x) for x til. The lemmas of the preceding 
sections show that the set 2 and the functional F(z) satisfy the 
theorem stated in §3. Theorem 1 is thus proved. 

It is easily seen that if an r- integral F t defined in a linear 
set 2,1)8 satisfies the condition f(x) = F 1 (x) for xed, then 
F{x)—F 1 {x) for all xe2. Consequently the functional / determines 
completely an f- integral in the set 2. 

n. Metrical compact sets. 

§8. Let now H be a complete and compact metrical space. 
We shall specify ® as the set of functions continuous in H. 

The set (£ satisfies evidently the conditions (i), § 2. It may 
be shown that any additive and non-negative functional / defined 
in ® satisfies the condition (ii 3 ) 1 ). 

Theorem 1 permits to define a Lebesgue integral F(x ) for all 
functions x belonging to a certain set 2D®, in such a manner that 
the condition R), p. 321, is satisfied and that F(x) = f(x) for xed. 

Evidently, every function x(t) which is constant on H belongs 
to d. It follows by condition C), p. 320, that every bounded function 
measurable in the sense of Borel belongs to d. 

We have thus proved the following 

Theorem 2. Every additive and non-negative functional , defined 
for ali functions which are continuous in a complete compact space H 1 
may by extended to an £-integral defined in a certain linear set ( con- 
taining all bounded functions measurable in the sense of Borel) so 
that the condition R) be satisfied. 

The values of this A’-integral for functions bounded and meas- 
urable (93) are, of course, determined by the given functional /. 
Hence the most general ^-integral defined for this class of functions 
may be obtained by. choosing an arbitrary additive non-negative 
functional defined for all functions which are continuous in H and 
by extending this functional by means of the method described in 
Part I of this note. 


l ) A functional of thin kind is necessarily linear. Every linear functional 
defined in <£ satisfies the condition (ii s ). See 8. Banach [I, p.224]. 
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Any linear functional f(x) defined in the set E is the difference 
of two additive non-negative functionals f x (x) and f 2 (x) (cf. S. Ba- 
nach [I, p. 217]). Extending these functionals by means of Theorem 1 
over two sets, 2 X and 2 2 say, respectively, we see that it is possible 
to extend the functional f(x) over the linear set 2=2 1 -2 a . This set 
will contain all bounded functions measurable (53). The extended ad- 
ditive functional F(x) evidently satisfies the conditions C) and R), 
p. 321, and is non-negative. 

ID. The Hilbert space. 

§». We shall now understand by H the unit sphere of 
the Hilbert space, i. e. the set of all sequences {$,) for which 

oo 

£$^1. The distance of two points <=(#,} and is defined, 

as usually, by the formula 


oo 

i=i 

With regard to this definition of distance the space H is 
not compact and therefore we cannot apply Theorem 2 directly. 

Let <£„ be the set of functions x=x (t) = x(d l ,d 21 ...) which are 
continuous in H and whose values depend only on the first n co- 
ordinates &j, so that #(0i, #*,... ) = #(#i, # 2 ,. 0,0,...) for any 
t={&t}e H. Clearly (E„C(E „+i. 

oo 

It is easily seen that the set (£==£(£„ satisfies the conditions 

#»-— 1 

(i), § 2. Any functional / defined in (E for which the conditions (ii) 
hold may be extended to an X- integral defined in a certain set 2 
containing (£. 

Lemma 12. The net 2 eontaim all bounded functions mesurable 
(53) defined in H. 

Proof. Let x be a bounded continuous function defined in H. 
For any point t = (0,,0 2 ,...,0„,...) and any positive integer «, we 
write #„(f) = #(0 i,...,0 n ,O,O,.„). Evidently x n c<& and lim#,,=#. If 

n 

M is the upper bound of |#ft)| for t(H, then Since the 

constant function z—M certainly belongs to <E, it follows from the 
condition C), p. 320, that are 2. 
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Consequently every bounded and continuous function belongs 
to £ and by the condition C) the same is true of any bounded func- 
tion measurable (SB). 

Lemma 13. Every additive and non-negative functional f(x) 
defined in (E satisfies the condition (ii 3 ), § 2. 

Proof. We define in H a distance /') of two points 
t' = by 


( 1 ) 


qi (MT 


= yi- 

" 2 1 1 
i—i 


+ 1#.-#; 


We easily verify that with regard to this distance the set II 
is complete and compact. 

Let d be the set of all functions defined in // which are 
continuous according to the distance defined by the formula (1). 
Evidently (EC(£. 

Let / be an additive non-negative functional defined in (£. 
Let x„(t) = x{9 19 0, 0, ...) for xe (E and t=(& 19 &?,...)€ H. 

With regard to the distance (1), H is a complete and compact 
space, and hence the function x(t)ed is uniformly continuous. It fol- 
lows that the sequence {#„} uniformly converges to x . This implies 
the convergence of the sequence {f(x„)) x ). Let f(x) — lim f(x n ). 

n 

If x^O, then x n ^0 for each w, and consequently f(x)^0. The 
functional f(x ), clearly additive, is therefore non-negative. The set J 3T 
being compact, it follows, by what has been established in Part II, 
that / satisfies in H the condition (ii 3 ) (with (E and f replaced by (E 
and / respectively). Since (E C (E and f(x) = f{x) for o:e(E, the 
functional / satisfies the condition (ii 3 ) in (E. 


§ 10 . Now consider an additive non-negative functional f(x) 
defined in (E. Let /„( x) denote the functional defined in (£„ by the 
formula 

( 2 ) f n (x)^f(x) for xed„. 

We obviously have 

( 3 ) f n (x) = f n + i(x) for xed n . 


1 ) Indeed if t > 0, there exists a positive integer N such that — e ^x p — 
whenever p>N , q>N. Since the constant function 2=1 belongs to (£, we have, 
for k=f( 1), the inequality ^.kr which proves the convergence 

of {/(*„)}• 
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Conversely, if we choose any sequence {/„(#!} of additive non- 
negative functionals, the functional f„ being defined in (£„ (where 
n=l,2,...) subject to the condition (3), then the formula (2) deter- 
mines an additive non-negative functional f(x) in (£. We thus obtain 
the most general additive non-negative functional f(x) defined in (£, 
and by what has been established in the preceding §, the most 
general Lebesgue integral for all functions bounded and measurable (©). 

The set d„ may be interpreted as the set of all function of n 
variables which are defined and continuous in the sphere 

#i+ It is known that the most general additive and non- 

negative functional defined in (£„ may be represented by a Stjeltjes 
integral. 

These general considerations will now be illustrated by the 
following example. Suppose that the functionals f„ are given by 
the formula 

(4) f n (x)= 


for Xfd n , where <p n denotes a fixed non-negative function integrable 
in the sphere #i + ... + #n<l. The condition (3) may be written in 
the form 



^n’^n + J^n+r 


To satisfy this condition, we may put, for instance, <p, = l/2 
and 9 ? n+1 — <p n /2 J'l— ...— & for n^l. We thus obtain 


(5) 






Let x be an arbitrary function bounded and continuous in H. 
We write again x n =x(& if 0,0,...). If |sc|<M, where M is 
a constant, then lim x„=x, |£c„|<Jlf. 
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Now let F be an ^-integral which for functions belonging 
to d coincides with the functional / subject to (2). We then have 
F(x) = ]im F(x„) = lim If further /„ is represented by the 

n n 

formula (4), then 

F(x) = Urn f...f *(3, , ..., 3,0, 0, ...) <p n (9 t , ..., 3J d3, ... d3 
*?+• ••+**<» 


and, in particular, if <p n is given by (5), 

F(x) = lim f ... fx(9 1 , . ., 3„, 0, 0, ...) .. -- . 

n J 2 Kl-35.Jl-tf-...-3*_i 




This formula defines explicitly a certain x’-integral for all func- 
tions bounded and continuous in H. 

The above considerations may be extended to certain spaces 
of the type (B) (cf. S. Banach [I, Chap. V]), e. g. the spaces t p \ 
l} p) with p>l. 
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Ahnost , 16. 

Angle , 310. 

Area , of an interval, 59 ; Elementary, 
of a polyhedron, 165; of a continuous 
surface, 165. 

Boundary of a set , 40. 

Centre , of an open sphere, 40; of a clos- 
ed sphere, 40. 

Chain of points, 122. 

Classes of sets , Additive, 7; Completely 
additive, 7; Additive in the weak sense, 
7; Complete with respect to a meas- 
ure 86. 

Closure of a set , 40. 

Coefficients of an approximate differen- 
tial 300. 

Complement of a set , 6. 

Condition , of Lipschitz for functions of 
two variables on an interval, 169; (1+), 
192; (17), 192; (l a ), 192; Lusin’s, (N), 
224; of Lipschitz for functions of a 
real variable on a set, 264; (T\), 277; 
(T t ), 277; (S), 282; (D), 290; of Lip- 
schitz for functions of two variables 
on a set, 304. 

Contingent , 263. 

Coordinates of a point , 56. 

Cubes , 57; Open, 57; Half open, 57. 

Covering in the sense of Vitali , 109. 

Curves , 121; rectifiable on an interval, 
122; Rectifiable, 122. 

Decomposition , Jordan, of a function 
of a set, 11; Jordan, of a function of 
an interval, 62; Lebesgue, of a func- 
tion of a set, 35; Lebesgue, of a func- 
tion of an interval, 120. 

Densities, Outer upper and lower, 128; 
Upper and lower, 129; Right-hand 
and left-hand, 215. 


Derivates, General upper and lower, 106; 
(Ordinary) upper and lower, 106; 
Strong upper and lower, 106; Unique, 
106; Extreme, 106; Bilateral, 108; Uni- 
lateral (right- and left -hand) 108; Dini, 
108; relative to a set, 108; with res- 
pect to a function, 108; Intermediate, 
108; Symmetrical, upper and lower, 
149; with respect to a measure and a 
sequence of nets in a metrical space, 
154; with respect to a sequence of 
nets of intervals, 188; Approximate 
right-hand and left-hand, upper and 
lower, 220; Approximate bilateral, 
upper and lower, 220; Opposite, 269. 

Derivative , General, 106; Ordinary, 
106; Strong, 106; Unilateral, 108; 
relative to a set, 108; with respect to 
a function, 109; with respect to a mea- 
sure and a sequence of nets in a me- 
trical space, 154; with respect to a se- 
quence of nets of intervals, 188; of 
a complex function, 196; Roussel, 
210; Approximate, 220. 

Diameter, 40. 

Difference of sets, 5. 

Differeyitial, Total, 300; Approximate, 
300; Upper and lower, 309; Extreme, 
309. 

Direction of a half-line, 262. 

Distance , of points, 39; of a point and 
a set, 40; of sets, 40; of points in an 
Euclidean space, 56; of points in the 
torus space, 157. 

Element of a set , 4. 

Expressions of die Gedcze , 171. 

Faces of an interval, 57 ; of a polyhedron. 
164. 

Families of sets, see Classes of sets . 

Figure, 58, Elementary, 58. 
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Function of singularities , of a function 
of a set, 35; of a function' of an inter- 
val, 120. 

Functions {of a complex variable ), Com- 
plex 195; Holomorphic. 195; Deri- 
vable, 196. 

Functions {of an interval ), 59; Contin- 
uous, 59; continuous and discontin- 
uous at a hyperplane, 60; Additive, 
61 ; of bounded variation. 61 ; Monoto- 
ne non -decreasing and non-increasing, 
61; Absolutely continuous, 93; Sin- 
gular, 93; Major and minor, 191; Maj- 
or and minor, with respect to a func- 
tion, 207. (See also Functions {of a real 
variable).) 

FnncUons{ofa point). Finite 6; Character- 
istic, of sets, 6; Simple, 7; Measurable 
12; Integrable, 20; Upper and lower 
semi-continuous, 42; Continuous, 42; 
integrable in the Lebesgue-Stietjes 
sense, 65; integrable in the Lebesgue 
sense, 65; Sumrnable, 65; Approx- 
imately continuous, 132; Equi-mea- 
surable, 143; Cylindrical, 159 (See 
also Functions {of an interval) and 
Functions {of a real variable).) 

Funt turns {of a real variable ), 96; of 
hounded v iriation, 96; Absolutely 
continuous, 96; Singular, 96; Regular, 
97; Sultus-. 97; integrable in the 
sense of Newton, 186; ^’-integrable, 
201; integrable in the sense of Per- 
ron, 201; Approximately derivable, 
220; of bounded variation (in the 
wide sense) 221; VB, 221; of genera- 
lized bounded variation (in the wide 
sense), 221; VBO, 221; absolutely 
continuous (in the wide sense), 223; 
AC, 223; generalized absolutely con- 
tinuous (in the wide sense), 223; ACG, 
223; of bounded variation in the res- 
tricted sense, 228; VB*, 228; of gene- 
ralized bounded variation in the res- 
tricted -eii-a*, 228; VUG*, 228; abso- 
lutely continuous in the restricted 
sense, 231; AC*, 231; generalized ab- 
solutely continuous in the restricted 
sense, 231; ACC*, 231; -^-integrable, 
211; ‘A* -integrable, 241; continuous 
in the sense of Darboux, 272; Inner 
and outer, of a superposition, 286; 
Invei>e, 286. (See also Functions {of an 
interval) and Functions {of a point).) 

Functions (of a set), Additive, 8; Mono- 
tone. S; Non decreasing and non-in- 
<* reading, s ; Absolutely continuous 
3n. 66, Singular, 30, 66. 


Functions (of two real variables ), of bound- 
ed variation, 169; Absolutely contin- 
uous, 169; Totally differentiable, 
300; Differentiable, 300; Approx- 
imately differentiable, 300. 

Graph, of a function, 88; of a curve, 121 . 

Half -plane, 263, 264. 

Half-space , 263, 306. 

Half -tangent. Intermediate, 262. 

Hyperplane, 57; orthogonal to an axis, 
57; Intermediate tangent, 263; Ex- 
treme tangent, 26 3; Unique tangent. 
263; Tangent, 263. 

Increment of a function , 96. 

Integrals, Definite, 19, 20, 46, 254; In- 
definite, 29, 254; Lebesgue-Stieltjes, 
65; Lebesgue, 65; Upper and lower 
Burkill, 166; Definite and indefinite 
Burkill, 166; Mean value, 178; Perron, 
201, y-, 201; Perron-Stieltjes, 207; 

207; Denjoy, in the wide sen- 
se, 241; Denjoy. Khintchine, 241; 

241; Denjoy, in the restricted 
sense, 241; Denjoy -Perron, 241; A,-, 
241; -1-, 241; c s -, 244; Compatible, 
254. 

Interior of a set, 40. 

Intervals, 57; Closed, 57; Open, 57; 
Half -open. 57; Degenerate, 57; in the 
torus space, 157. 

Length, Outer, of a set, 54; of a set 54; of 
an interval, 59; of arc of a curve, 122; 
of a curve, 1-22 

Limit . of a sequence of sets, 5; Upper 
and lower, of a sequence of sets, 5; 
of a sequence of points, 39; Approx- 
imate, upper and lower, 218; Approx- 
imate extreme, 219; Approximate, 
219; Approximate bilateral and uni- 
lateral, 220. 

Lines, Straight, 56, 57. 

Maximum, of a function at a point, 42; 
Strict, 261 . 

Measure, 16; Outer Carathdodory, 43; 
of a set, 46; Outer, of a set, 46; Reg- 
ular outer, 50; Outer Lebesgue, 65; 
Lebesgue, 65; - function, 117; Outer, 
in the torus space, 157; in the torus 
space, 158; Outer linear, of a set on 
a line, 297. 

Mesh of a net, 153. 

Minimum , of a function at a point, 42; 
Strict. 261. 
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Neighbourhood of a pointy 40. 

Net, of (dosed intervals, 57 ; of half open 
intervals, 57; in a metrical space, 153; 
Normal, of intervals, 188. 

Nucleus of a determining system, 47. 

Numbery Characteristic, of a family of 
sets, 40. 

Operation (A), 47. 

Ordinate-set of a function, 88. 

Oscillation , of a function of a point at 
a point, 42; of a function of an inter- 
val on a set, 60; of a function of an 
interval at a set, 60. 

Parameter , of regularity of a set, 106; 
of a curve, 121. 

Part, Common, of sets, 5; Non -negative 
and non-positive, of a function, 13; 
Absolutely continuous, of a function 
of an inteival, 120; Real and ima- 
ginary, of a complex function, 195. 

Plane , 56, 57; Whole, 264. (See also 
Hyperplane.) 

Points , of a space, 6; of accumulation, 
40; Isolated, 40; Internal, 40; of a cur- 
ve. 121; of outer density, 128; of 
dispersion, 128; of density, 129; Right- 
and left-hand, of accumulation, 215; 
isolated on the right or the left, 215; 
Unilateral (right- and left-hand),* of 
outer density, 215; of linear density 
in the direction of an axis, 298. 

Polyhedron , 164. 

Portion of a set, 41. 

Primitive of Newton, 186. 

Product of sets, 5; Combinatory, 82; (’ar- 
tesian, 82. 

Radius, of a neighbourhood, 40; of an 
open sphere, 40; of a closed sphere, 40. 

Relations, Denjoy, 269. 

Sequences , Convergent, of sets, 5; As- 
cending and descending, of sets, 5; 
Non decreasing and non-increasing, 
of sets, 5; Monotone, of sets, 5; Con- 
vergent, of points, 40; Regular, of 
nets of intervals, 57 ; Regular, of nets 
in a metrical space, 153; of mean 
value integrals, 178; Normal, of nets 
of intervals, 188; Binary, of nets of 
intervals, 191. 


Sets, Empty, 4; Enumerable, 4; Meas- 
urable, 7; Bounded, 40; Derived, 40; 
Closed, 40; Isolated, 40; Perfect, 40; 
Open, 40; Non -overlapping, 40; (J), 
40; ((&), 40; ("£), 41; measurable ( s 8), 
4 1 ; Borcl, 4 1 ; closed in a set, 4 1 ; open 
in a set, 41; everywhere dense in a set, 
41; non-dense in a set, 41; of the first 
and the second category (in a set), 41; 
Separable, 41; measurable with res- 
pect to an outer Caratheodory meas- 
ure, 14; regular with respect to an 
outer Caratheodory measure, 50; of 
zero length, 54; of zero area, 54; of 
zero volume, 54; of finite length, 54; 
of finite area, 54; of finite volume, 54; 
Linear, 56; Plane, 56; Ordinate-, 88; 
Measurable, in the torus space, 158; 
Cylindrical, in the torus space, 159. 

Side of a hyperplane, 263; Empty, 263. 

Space, Abstract, 6; Metrical, 39; Com- 
plete, 54; Euclidean, 56; Torus, 157; 
Whole, 306. 

Sphere, Open, 40; Closed, 40. 

Square , 57. 

Subdivision of a figure . 165. 

Subsets, 4. 

Sum of sets, 5. 

Superposition of functions , 286. 

Surfaces , Continuous, 164. 

System, Determining. 47; Degenerate 
determining, 49. 

Tangent , Intermediate, 263; Extreme, 
263; Unique, 263. 

Torus space , 157. 

Translation, of a set by a vector, 91; 
in the torus space, 158. 

Variation, Relative (upper and lower) 
and absolute, of a function of a set, 10; 
Relative (upper and lower) and ab- 
solute, of a function of an interval, 
61, 62; of a fund ion of a real variable, 
96; of a function of an interval at a set, 
166; Weak, 221; Strong. 228. 

Vertex, of a polyhedron, 164; of an angle, 
310. 

Volume, of an interval, 59; of an interval 
in the torus space, 157. 



NOTATIONS. 


CHAPTER I. e,4; ©,4(254)0, = , 4; .2,5(258); P, + ,—, lim inf, lira sup, 6; 
lim, 5 (39); ((E), (E^, (£„ (where (E is a class of sets), 5; C, 6; c E (where E is a set), 0; 
<t>j 9 E ] ; ...; v n , E n ) (where E v ...» E n are sets and v v ...,v n numbers), 7; W, J£,W, 
10 (61, 96); E, 12; /, j (where / is a function), 13; (9 (where is a class of 
sets, / a function of a point and a measure), 20. 

CHAPTER II. (>,39(40); lim, 39 (5);?, <$, d, 40; S, 41(122, 166); S, 41; A\ I, 4° 
(where A is a set), 42; <E>, 5» 40; 93, 41; M, m, 42; o, 42 (60); 2 r (where Pis an 
outer Carath^odory measure), 44; (P), j/dP (where P is an outer Carath^odory 
measure and / a function of a point), 46; N,47; A, 63. 

CHAPTER III. if, 56; (x v ...,x m ) (where x v ...,x m are real numbers), 60; 
[«,,&, a m ,bj, [o,, 6, ; a m , bj, (a,, 6, ; a m , bj, (a,, 6,, a m , bj (where 
a v b v ..., a m , b m are real numbers), 57; ©, Q, 59; 0,60(254); o, 60(42); 

W,W,W, 61 (10, 96); U* (where U is a function of an interval), 64; jfdU (where/is 
a function of a point and U a function of an interval), 64; L, 1 1, 65 (117); $f(x)dx 
(where / is a function of a point), 66; x, 76 (82); U V (where U and V are functions 
of an interval), 76; x, 82(76); 9E9) (where 9E and 9) are classes of sets), 82; f*v 
(where f* and v are measures), 86; (where T is a class of sets and i a measure), 
87; A, 88; B, 88 (121); Q ^ (where Q is a set and a a point), 91; W, W, W, 96 
(10,61); /(o-f), /(a — ) (where / is a function and a a point), 97; F(I) (where 
F is a function of a real variable and I an interval), 99; F[E] (whore F is a func- 
tion of a real variable and E a set), 100. 

CHAPTER IV. r, D, D, D, 106 (154); 0', $ s , $ s , <F S , 106; P 4 ,P+,P",P ”, 

F E ,F E ,F Et Fy,F^,F^, F~jj, F' v (where E is a set, and F, U functions of a real vari- 
able), 108, 109; L, 117(65); p, 121; B, 121 (88); S, 122 (41, 165); f* t f' (where 
/ is a function of a real variable), 144; f (where / is a function of two real vari- 
ables), 147; D sym , D sym , 149; (/u, 9H)D, ([*, 9J1) D, (^, 9W) D (where ^ is a measure 
and 3R a sequence of nets), 154 ( 106); 157; (> m , £' m , (where m is an integer 

and i a point in the torus space), 157; L w , L*,157; 4-, 158. 

CHAPTER V. S 0 , S 165 (41, 122); U (3) (where U is a function of an interval 
and 3 a system of intervals), 165; jU t jU t jU (where U is a function of an inter- 
val), 166; W lf W t< 169; G t , G 2 , G, 171 H x , H t ,H, 174. 

CHAPTER VI. (91)P, (91)F, (91)F' (where 91 is a sequence of nets of inter- 
vals and F a function of an interval), 188; Jj, J 2 , J, 195; Oj, 201; (FS)j 9 208. 

CHAPTER VII. ( — oo, a), ( — oo, a], (a, + oo), fa, -f oo) (where a is a real 
number), 215; lim sup ap, lim inf ap, lim ap, 219; F+, P ap , F ap , ^ap’^ap’ ^ap 
(where F is a function), 220; V, 221; V*, 228. 

CHAPTER VIII. (9)/, 241; ($)f t 244; 0,254(60); ©,254 (4); 

£ h , (where c is an integral), 255 and 256; 2 , 258 (5). 


CHAPTER IX. contg, 263; F+^F+^F^F^F^, 

P n ,F' (where F is a function of two variables), 298; liming, limsup^, lim,,; 

A Vy a Yy 


(where E is a set), 304. 
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